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Enzymatic synthesis of deoxyribonucleosides 



Description 

5 

The present invention relates to a method for the in vitro enzymatic 
synthesis of deoxyribonucleosides and enzymes suitable for this method. 

Natural deoxyribonucleosides (deoxyadenosine, dA; deoxyguanosine, dG; 
10 deoxycytidine, dC and thymidine, dT) are building blocks of DNA. The N- 
glycosidic bond between nucleobase and sugar involves the N 1 of a 
pyrimidine or the N 7 of a purine ring and the C n of deoxyribose. 




In the living cells the four deoxyribonucleosides (dN) result from the 
"salvage pathway" of nucleotide metabolism. A group of enzymes is 
involved in cellular catabolism of deoxyribonucleosides. Besides 
deoxyriboaldolase (EC 4.1.2.4) and deoxyribomutase (EC 2.7.5.1), this 
30 group also includes thymidine phosphorylase (EC 2.4.2.4) and purine 
nucleoside phosphorylase (EC 2.4.2. 1 ). These four nzymes are induced by 
the addition of deoxyribonucleosides to the growth medium. The genes 
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coding for these enzymes have been shown to map closely together on the 
bacterial chromosome (Hammer-Jesperson and Munch-Peterson, 
Eur.J.Biochem.17 (1970), 397 and literature cited therein). In E.coli the 
genes as described above are located on the deo operon which exhibits an 
unusual and complicated pattern of regulation (Valentin-Hansen et aL, 
EMBO J.1 (1982), 317). 

Using the enzymes of the deo operon for synthesis of deoxynucleosides 
was described by C.F.Barbas III (Overproduction and Utilization of Enzymes 
in Synthetic Organic Chemistry, Ph.D. Thesis (1989), Texas A&M 
University). He applied phosphopentomutase and thymidine phosphorylase 
for the synthesis of deoxynucleosides. Deoxyribose 5-phosphate was 
prepared by chemical synthesis (Barbas III et aL, J.Am.Chem.Soc. 112 
(1990), 2013-2014), which makes this compound expensive as starting 
material and not suitable for large scale synthesis. He also made 
deoxyriboaldolase available as a recombinant enzyme and investigated its 
synthetic applicability but neither he nor C.-H.Wong (Microbial Aldolases in 
Carbohydrate Synthesis: ACS Symp.Ser.No.466: Enzymes in Carbohydrate 
Synthesis, Eds. M.D.Bednarski, E.S.Simon (1991), 23-27) were able to 
carry out a coupled one-pot synthesis employing all three enzymes. It 
appears likely that some drawbacks exist which could not be circumvented. 
Among these drawbacks are insufficient chemical equilibrium, instability of 
intermediates, such as deoxyribose 1 -phosphate and inactivation and 
inhibition effects of involved compounds on the enzymes. 

Evidence of an advantageous equilibrium is given by S.Roy et al. (JACS 1 08 
(1 986), 1 675-78). For the aldolase reaction the equilibrium is on the desired 
product side (deoxyribose 5-phosphate), for the phosphopentomutase it is 
on the wrong side (also deoxyribose 5-phosphate) and for the purine 
nucleoside phosphorylase it is on the desired synthesis product side. Th 
authors suggest coupling of the three enzyme reactions to obtain reasonable 
yields. Contrary to these suggestions they prepared deuterated 
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deoxyguanosine and thymidine in a two step procedure, that is deoxyribose 
5-phosphate in a first step and deoxynucleoside in a second step. Isolated 
yields of the second step were 11% and 5% for deoxyguanosine and 
thymidine, respectively. These low yields are also obtained in the 
5 preparation of arabinose-based nucleosides (Barbas III (1990), supra). 

These low yields indicate serious drawbacks for the use of the enzymes of 
the deo operon in a synthetic route which have to work in the reverse 
direction of their biological function, which is degradation of 
deoxynucleosides. 

10 

Thus, there does not exist any economical commercial method at present ^ 
for the enzymatic in vitro synthesis of deoxyribonucleosides. Hitherto, for 
commercial purposes, deoxynucleosides are generated from fish sperm by 
enzymatic cleavage of DNA. This method, however, involves several 
15 disadvantages, particularly regarding difficulties of obtaining the starting 
material in sufficient quantity and quality. 

Therefore, it was an object of the invention to provide a method, by means 
of which the drawbacks of the prior are eliminated at least partially and 
20 which allows efficient and economical synthesis of deoxyribonucleosides 
without any dependence on unreliable natural sources. 

Surprisingly, it was found that the drawbacks of previous enzymatic 
synthesis routes can be avoided and deoxyribonucleosides can be obtained 
25 in high yields of e.g. at least 80% based on the amount of starting material. 

In a first aspect, the present invention relates to a method for the in vitro 
enzymatic synthesis of deoxyribonucleosides comprising reacting 
deoxyribose 1 -phosphate (dRIP) and a nucleobase, wherein a 
30 deoxyribonucleoside and inorganic phosphate are formed. 
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The reaction is catalyzed by an enzyme which is capable of transferring a 
deoxyribose moiety to a nucleobase, with a deoxyribonucleoside being 
formed. Preferably, the reaction is catalyzed by a thymidine phosphorylase 
(TP, EC 2.4.2.4) or a purine nucleoside phosphorylase (PNP, EC 2.4.2.1). 
5 For the EC designation of these enzymes and other enzymes mentioned 
below reference is made to the standard volume Enzyme Nomenclature 
1992, Ed. E.C.Webb, Academic Press, Inc. 

These enzymes and other enzymes mentioned below are obtainable as 
10 native proteins from natural sources, i.e. any suitable organisms selected 
from eukaryotes, prokaryotes and archaea including thermophilic organisms. 
Further, these enzymes are obtainable as recombinant proteins from any 
suitable host cell which is transformed or transfected with a DNA encoding 
said enzyme. The host cell may be a eukaryotic cell, a prokaryotic cell or an 
15 archaea cell. Particular preferred sources of native or recombinant TP or 
PNP are prokaryotic organisms such as E.coli. Recombinant TP may be 
isolated from E.coli strain pHSP 282 (CNCM 1-2186) deposited on April 23, 
1999, which is a recombinant E.coli strain transformed with a plasmid 
containing the E.coli deoA (thymidine phosphorylase) insert. Recombinant 
20 PNP may be isolated from E.coli strain pHSP 283 (CNCM 1-21 87) deposited 
on April 23, 1 999, which is a recombinant E.coli strain transformed with a 
plasmid containing the E.coli deoD (purine nucleoside phosphorylase) insert. 
The nucleotide sequence of the TP gene and the corresponding amino acid 
sequence are shown in SEQ ID N0.1 and 2. The nucleotide sequence of the 
25 PNP gene and the corresponding amino acid sequence are shown in SEQ ID 
NO. 3 and 4. 

The nucleobase, to which the deoxyribose unit is transferred, will be 
selected from any suitable nucleobase. For example, the nucleobase may 
30 be a naturally occurring nucleobase such as thymine, uracil, adenine, 
guanine or hypoxanthine. It should be noted, however, that also non- 
naturally occurring analogs thereof are suitable as enzyme substrates such 
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as 2-thio-uracil, 6-aza-uracil, 5-carboxy-2-thiouracil, 6-aza-thymine, 6-aza-2- 
thio-thymine and 2,6-diamino-purine. 



Preferably the inorganic phosphate is removed from the reaction. This 
removal is preferably effected by (i) conversion to inorganic pyrophosphate, 
(ii) precipitation/complexation and/or (iii) substrate phosphorylation. 



Conversion to inorganic pyrophosphate may be effected by a phosphate 
transfer from a phosphorylated, preferably polyphosphorylated substrate 

10 such as fructose diphosphate (FDP), wherein a phosphate group is cleaved 
from the phosphorylated substrate and reacts with the inorganic phosphate, 
with inorganic pyrophosphate (PPi) being formed. This phosphate transfer 
is preferably catalyzed by a PPi-dependent phosphorylase/kinase, e.g. by a 
PPi-dependent phosphofructokinase (PFK-PPi, EC 2.7.1.90), which catalyzes 

15 the reaction of fructose diphosphate (FDP) and inorganic phosphate to 
fructose 6-phosphate (F6P) and inorganic pyrophosphate. Preferred sources 
of PPi-dependent kinases/phosphorylases and genes coding therefor are 
from Propionibacterium freudenreichii (shermanii) or from potato tubers. 



20 Further, the inorganic phosphate may be removed from the reaction by 
precipitation and/or complexation which may be effected by adding 
polyvalent metal ions, such as calcium or ferric ions capable of precipitating 
phosphate or by adding a complex-forming compound capable of 
complexing phosphate. It should be noted that also a combination of 

25 pyrophosphate formation and complexation/ precipitation may be carried 
out. 



Furthermore, the removal of inorganic phosphate may be effected by 
substrate phosphorylation. Thereby the inorganic phosphate is transferred 
30 to a suitable substrate, with a phosphorylated substrate being formed. The 
substrate is preferably selected from saccharides, e.g. disaccharides such 
as sucrose or maltose. When using disaccharides as substrate, a 
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monosaccharide and a phosphorylated monosaccharide are obtained. The 
phosphate transfer is catalyzed by a suitable phosphorylase/kinase such as 
sucrose phosphorylase (EC 2.4. 1 .7) or maltose phosphorylase (EC 2.4.1 .8). 
Preferred sources of these enzymes are Leuconostoc mesenteroides 
(sucrose phosphorylase) and Lactobacillus brevis (maltose phosphorylase). 

The phosphorylated substrate may be further reacted by additional coupled 
enzymatic reactions, e.g. into a galactoside (Ichikawa et al., Tetrahedron 
Lett.36 (1995), 8731-8732). Further, it should be noted that phosphate 
removal by substrate phosphorylation may also be coupled with other 
phosphate removal methods as described above. 

Deoxyribose 1-phosphate (dR1P), the starting compound of the method of 
the invention, is a rather unstable compound, the isolation of which is 
difficult. In a preferred embodiment of the present invention, d1RP is 
generated in situ from deoxyribose 5-phosphate (dR5P) which is relatively 
stable at room temperature and neutral pH. This reaction is catalyzed by a 
suitable enzyme, e.g. a deoxyribomutase (EC 2.7.5.1) or a phosphopentose 
mutase (PPM, EC 5.4.2.7) which may be obtained from any suitable source 
as outlined above. The reaction is preferably carried out in the presence of 
divalent metal cations, e.g. Mn 2+ or Co 2+ as activators. Preferred sources 
of deoxyribomutase are enterobacteria. Particular preferred sources of 
native or recombinant PPM are prokaryotic organisms such as E.coli. 
Recombinant PPM may be isolated from E.coli strain pHSP 275 (CNCM I- 
2188) deposited on April 23, 1999, which is a recombinant E.coli strain 
transformed with a plasmid containing the E.coli deo B (phosphopentose 
mutase) insert. The nucleotide sequence of the PPM gene and the 
corresponding amino acid sequence are shown in SEQ ID NO. 5 and 6. 

dR5P may be generated by a condensation of glyceraldehyde 3-phosphat 
(GAP) with acetaldehyde. This reaction is catalyzed by a suitable enzyme, 
preferably by a phosphopentose aldolase (PPA, EC 4.1 .2.4). The reaction 
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exhibits an equilibrium constant favorable to the formation of the 
phosphorylated sugar (K eq = [dR5P]/[acetaldehyde] x [GAP] = 4.2 x 10 3 x 
M" 1 ). PPA forms an unstable Schiff base intermediate by interacting with the 
aldehyde. Particular preferred sources of native or recombinant PPA are 

6 prokaryotic organisms such as E.coli. Recombinant PPA may be isolated 
from E.coli strain pHSP 276 {CNCM 1-2189) deposited on April 23, 1999. 
This recombinant E.coli strain is transformed with a plasmid containing the 
deoC (phosphopentosealdolase) insert. The nucleotide sequence of the PPA 
gene and the corresponding amino acid sequence are shown in SEQ ID 

10 NO. 7 and 8. 

GAP is a highly unstable compound and, thus, should be generated in situ 
from suitable precursors which are preferably selected from fructose 1,6- 
diphosphate (FDP), dihydroxyacetone (DHA) and/or glycerolphosphate (GP), 
15 with FDP being preferred. 

FDP can be converted by an FDP aldolase (EC 4.1 .2.1 3) selected from FDP 
aldolases I and FDP aldolases II to GAP and dihydroxyacetone phosphate 
(K eq = [FDP1/IGAP] x [DHAP] = 10 4 M 1 ). The two families of FDP aldolases 

20 giving identical end products (GAP and DHAP) via two chemically distinct 
pathways may be used for this reaction. FDP aldolase I forms Schiff base 
intermediates like PPA, and FDP aldolase II which uses metals (Zn 2+ ) 
covalently bound to the active sites to generate the end products. FDP- 
aldolase I is characteristic to eukaryotes, although it is found in various 

25 bacteria. FDP-aldolase II is more frequently encountered in prokaryotic 
organisms. If FDP-aldolase reacts with FDP in the presence of acetaldehyde, 
the latter compound can interact with DHAP to yield an undesired 
condensation by-product named deoxyxylolose 1 -phosphate (dX1P). Thus, 
the reaction is preferably conducted in a manner by which the generation 

30 of undesired side products is reduced or completely suppressed. 
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Particular preferred sources of native or recombinant FDP aldolases are 
prokaryotic or eukaryotic organisms. For example, FDP aldolase may be 
isolated from rabbit muscle. Further, FDP aldolase may be obtained from 
bacteria such as E.coli. Recombinant FDP aldolase may be isolated from 
recombinant E.coli strain pHSP 284 (CNCM 1-2190) which is transformed 
with a plasmid containing the E.coli fba (fructose diphosphate aldolase) 
insert. The nucleotide sequence of the E.coli FDP aldolase gene and the 
corresponding amino acid sequence are shown in SEQ ID NO. 9 and 10. 

On the other hand, GAP may be generated from DHAP and ATP, with 
dihydroxyacetone phosphate (DHAP) and ADP being formed and 
subsequent isomerization of DHAP to GAP in a reaction catalyzed by a 
glycerokinase (GK, EC 2.7.1 .30) and a triose phosphate isomerase (TIM, EC 
5.3.1 .1). Suitable glycerokinases are obtainable from E.coli, suitable triose 
phosphate isomerases are obtainable from bovine or porcine muscle. 

In a still further embodiment of the present invention GAP may be generated 
from glycerol phosphate (GP) and 0 2 , with DHAP and H 2 0 2 being formed 
and subsequent isomerization of DHAP to GAP in a reaction catalyzed by 
a glycerophosphate oxidase (GPO, EC 1.1.3.21) and a triose phosphate 
isomerase (TIM, EC 5.3.1.1). Suitable glycerophosphate oxidases are 
obtainable from Aerococcus viridans. 

In an alternative embodiment of the present invention deoxyribose 5- 
phosphate (dR5P) is generated by phosphorylation of deoxyribose. 
Preferably this reaction is carried out in the presence of a suitable enzyme, 
e.g. a deoxyribokinase (dRK, EC 2.7.1.5) which may be obtained from 
prokaryotic organisms, particularly Salmonella typhi and in the presence of 
ATP. The nucleotide sequence of the Salmonella dRK gene and the 
corresponding amino acid sequence are shown in SEQ ID N0.1 1 and 12. 
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By the reaction as outlined above deoxyribonucleosides are obtained which 
contain a nucleobase which is accepted by the enzymes TP and/or PNP. TP 
is specific for thymidine (T), uracil (U) and other related pyrimidine 
compounds. PNP uses adenine, guanine, hypoxanthine or other purine 
5 analogs as substrates. 

The synthesis of deoxyribonucleosides which are not obtainable by direct 
condensation such as deoxycytosine (dC), thus, require an additional 
enzymatic reaction, wherein a deoxyribonucleoside containing a first 
10 nucleobase is reacted with a second nucleobase, with a second 
ribonucleoside containing the second nucleobase being formed. The second 
nucleobase is preferably selected from cytidine and analogs thereof such as 

5- azacytidine. It should be noted, however, that also other nucleobases 
such as 6-methyl purine, 2-amino-6-methylmercaptopurine, 6- 

15 dimethylaminopurine, 2,6-dichloropurine, 6-chloroguanine # 6-chloropurine, 

6- azathymine, 5-fluorouracil, ethyl-4-amino-5-imidazol carboxylate, imidazole 
4-carboxamide and 1 ,2,4-triazoIe-3-carboxamide may be converted to the 
corresponding deoxyribonucleoside by this nucleobase exchange reaction 
(Beaussire and Pochet, Nucleosides & Nucleotides 14 (1995), 805-808, 

20 Pochet et aL, Bioorg.Med.Chem.Lett.5 (1995), 1679-1684, Pochet and 
Dugue, Nucleosides & Nucleotides 17 (1998), 2003-2009, Pistotnik et al.. 
Anal. Biochem. 271 (1999), 192*199). This reaction is preferably catalyzed 
by an enzyme called nucleoside 2-deoxyribosyltransferase (NdT, EC 
2.4.2.6) which transfers the glycosyl moiety from a first deoxynucleoside 

25 to a second nucleobase, e.g. cytosine. A preferred source of native or 
recombinant NdT are prokaryotic organisms such as lactobacilli, particularly 
Lactobacillus leichmannii. Recombinant NdT may be isolated from 
recombinant E.coli strain pHSP 292 (CNCM 1-2191) deposited on April 23, 
1 999, which is transformed with a plasmid containing the L.leichmannii NdT 

30 (nucleoside 2-deoxyribosyltransferase) insert. The nucleotide sequence of 
the NdT gene and the corresponding amino acid sequence are shown in 
SEQ ID NO. 13 and 14. 
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A further aspect of the present invention is a method for the in vitro 
enzymatic synthesis of deoxyribonucleosides comprising the steps of: (i) 
condensing glyceraldehyde 3-phosphate (GAP) with acetaldehyde to 
deoxyribose 5-phosphate (dR5P), (ii) isomerizing deoxyribose 5-phosphate 
to deoxyribose 1 -phosphate (dR1P) and (iii) reacting deoxyribose 1- 
phosphate and a nucteobase, wherein a deoxyribonucleoside and inorganic 
phosphate are formed. Preferably, the reaction is carried out without 
isolating intermediate products and, more preferably, as a one-pot reaction. 
Further, the removal of the inorganic phosphate from the reaction is 
preferred. 



As outlined above, the glyceraldehyde 3-phosphate may be generated from 
FDP, DHA and/or GP. Preferably, FDP is used as a starting material. 



15 In order to avoid the production of undesired by-products and the toxic 
effects of acetaldehyde, the course of the reaction is preferably controlled 
by suitable means. Thus, preferably, the reaction is carried out in a manner 
such that the acetaldehyde concentration in step (ii) is comparatively low, 
e.g. less than 100 mM, particularly less than 50 mM, e.g. by adding the 

20 acetaldehyde in portions or continuously during the course of the reaction 
and/or by removing excess acetaldehyde. Further, it is preferred that before 
step (ii) excess starting materials and/or by-products, particularly fructose 
1 ,6-diphosphate and/or deoxyxylulose 1-phosphate (dX1P), are removed. 
This removal may be effected by chemical and/or enzymatic methods, e.g. 

25 precipitating FDP with ferric salts or enzymatically degrading X1P via 
dihydroxyacetone phosphate. Alternatively or additionally the reaction 
conditions may be adjusted such that before step (ii) no substantial 
amounts, preferably less than 10 mM, of starting materials and/or by- 
products, particularly fructose 1 ,6-diphosphate and/or deoxyxylulose 1- 

30 phosphate, are present in the reaction mixture. 
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In still another embodiment, the present invention relates to a method for 
the in vitro enzymatic synthesis of deoxyribonucleosides comprising the 
steps of: (i) phosphorylating deoxyribose to deoxyribose 5-phosphate, (ii) 
isomerizing deoxyribose 5-phosphate to deoxyribose 1 -phosphate and (iii) 
5 reacting deoxyribose 1 -phosphate and nucleobase, wherein a 
deoxyribonucleoside and inorganic phosphate are formed. Preferably, these 
reactions are carried out with isolating intermediate products and, more 
preferably, as a one-pot reaction. To obtain a better yield the removal of 
inorganic phosphate from step (iii) is preferred. 

10 

By the process as described above naturally occurring deoxyribonucleosides 
such as dA, dG, dT, dU and dT but also analogs thereof containing non- 
naturally occurring nucleobases and/or non-naturally occurring deoxyribose 
sugars such as 2'-deoxy-3'-azido-deoxyribose or 2'-deoxy-4'-thio-deoxy- 
i5 ribose may be produced. 

The deoxyribonucleosides obtained may be converted to further products 
according to known methods. These further reaction steps may comprise 
the synthesis of deoxyribonucleoside mono-, di- or triphosphates, of H- 
20 phosphonates or phosphoramidites. Additionally or alternatively, labelling 
groups such as radioactive or chemical labelling groups may be introduced 
into the deoxyribonucleosides. 

Still a further aspect of the present invention is the use of an isolated 
25 nucleic acid molecule encoding a nucleoside 2-deoxyribosyl transferase 
(NdT, EC 2.4.2.6) for the preparation of an enzyme in an in vitro enzymatic 
synthesis process, wherein a deoxyribonucleoside containing a first 
nucleobase is reacted with a second nucleobase, with a 
deoxyribonucleoside containing the second nucleobase being formed. The 
30 second nucleobase is preferably selected from cytidine and analogs thereof, 
2,6-dichloro-purine, 6-chloro-guanine, 6-chloro-purin , 6-aza-thymine and 
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5-fluoro-uracil. The first nucleobase is preferably selected from thymine, 
guanine, adenine or uracil. 

More preferably, the nucleic acid molecule encoding an NdT comprises (a) 
5 the nucleotide sequence shown in SEQ ID NO. 13 or its complementary 
sequence, (b) a nucleotide sequence corresponding to the sequence of (a) 
in the scope of degeneracy of the genetic code or (c) the nucleotide 
sequence hybridizing under stringent conditions to the sequence (a) and/or 
(b). Apart from the sequence of SEQ ID NO. 13 the present invention also 
o covers nucleotide sequences coding for the same polypeptide, i.e. they 
correspond to the sequence within the scope of degeneracy of the genetic 
code, and nucleotide sequence hybridizing with one of the above-mentioned 
sequences under stringent conditions. These nucleotide sequences are 
obtainable from SEQ ID NO. 13 by recombinant DNA and mutagenesis 
is techniques or from natural sources, e.g. from other Lactobacillus strains. 

Stringent hybridization conditions in the sense of the present invention are 
defined as those described by Sambrook et al., Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press (1989), 1.101- 
20 1.104. According to this, hybridization under stringent conditions means 
that a positive hybridization signal is still observed after washing for one 
hour with 1 x SSC buffer and 0.1 % SDS at 55 °C, preferably at 62°C and 
most preferred at 68 °C, in particular, for one hour in 0.2 x SSC buffer and 
0.1% SDS at 55°C, preferably at 62°C and most preferred at 68°C. 

25 

Moreover, the present invention also covers nucleotide sequences which, 
on nucleotide level, have an identity of at least 70%, particularly preferred 
at least 80% and most preferred at least 90% to the nucleotide sequence 
shown in SEQ ID NO. 13. Percent identity are determined according to the 
30 following equation: 
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I = x 100 

L 

5 wherein I are percent identity, L is the length of the basic sequence and n 
is the number of nucleotide or amino acid difference of a sequence to the 
basic sequence. 

Still another subject matter of the present invention is a recombinant vector 
io comprising at least one copy of the nucleic acid molecule as defined above, 
operatively linked with an expression control sequence. The vector may be 
any prokaryotic or eukaryotic vector. Examples of prokaryotic vectors are 
chromosomal vectors such as bacteriophages (e.g. bacteriophage Lambda) 
and extrachromosomal vectors such as plasmids (see, for example, 
15 Sambrook et aL, supra, Chapter 1-4). The vector may also be a eukaryotic 
vector, e.g. a yeast vector or a vector suitable for higher cells, e.g. a 
plasmaid vector, viral vector or plant vector. Suitable eukaryotic vectors are 
described, for example, by Sambrook et aL, supra, Chapter 16. The 
invention moreover relates to a recombinant cell transformed with the 
20 nucleic acid or the recombinant vector as described above. The cell may be 
any cell, e.g. a prokaryotic or eukaryotic cell. Prokaryotic cells, in particular, 
E.coli cells, are especially preferred. 

The invention refers to an isolated polypeptide having NdT activity encoded 
25 by the above-described nucleic acid and its use for the preparation of 
deoxyribonucleosides. Preferably, the polypeptide has the amino acid 
sequence shown in SEQ ID NO. 14 or an amino acid sequence which is at 
least 70%, particulary preferred at least 80% and most preferred at least 
90% identical thereto, wherein the identity may be determined as described 
30 above. 

Finally, the present invention also relates to the use of isolated nuclecic acid 
molecules having thymidine phosphorylase (TP), purine nucleoside 
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phosphorylase (PNP), phosphopentose mutase (PPM), phosphopentose 
aldolase (PPA), FDP aldolase and deoxyribokinase (dRK) activity for the 
preparation of an enzyme for a method for the in vitro synthesis of 
deoxynucleosides. Preferably, these nucleic acids are selected (a) from a 
5 nucleotide sequence shown in SEQ ID NO.1, 3, 5, 7, 9 or 1 1 or their 
complementary sequences, (b) a nucleotide sequence corresponding to a 
sequence of (a) within the scope of degeneracy of the genetic code or (c) 
a nucleotide sequence hybridizing under stringent conditions to a sequence 
(a) and/or (b). 

10 

Isolated polypeptides having TP, PNP, PPM, PPA, FDP aldolase or dRK 
activity encoded by the above-described nucleic acids may be used for the 
preparation of deoxyribonucleosides. Preferably, these polypeptides have 
the amino acid sequence shown in SEQ ID NO. 2, 4, 6, 8, 10 or 12 or an 
is amino acid sequence which is at least 70%, particulary preferred at least 
80% and most preferred at least 90% identical thereto, wherein the identity 
may be determined as described above. 

An isolated nucleic acid molecule encoding a dRK may be used for the 
20 preparation of an enzyme for an in vitro method for the enzymatic synthesis 
of deoxyribonucleosides comprising the step of phosphorylating deoxyribose 
to deoxyribose 5-phosphate, wherein said nucleic acid molecule comprises 
(a) the nucleotide sequence shown in SEQ ID N0.1 1 or its complementary 
sequence, (b) a nucleotide sequence corresponding to the sequence of (a) 
25 in the scope of the degeneracy of the genetic code or (c) a nucleotide 
sequence hybridizing under stringent conditions to the sequence of (a) 
and/or (b). Correspondingly, an isolated polypeptide having dRK activity is 
suitable for an in vitro method for the enzymatic synthesis of 
deoxyribonucleosides as outlined above. 

30 



•08-1999 



The E.coli strains pHSP 282 (CNCM 1-2186), pHSP 283 {CNCM 1-2187), 
pHSP 275 (CNCM 1-2188), pHSP 276 (CNCM 2189), pHSP 284 (CNCM I- 
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2190) and pHSP 292 (CNCM 1-2191) were deposited according to the 
regulations of the Budapest Treaty on April 23, 1999 at the Collection 
Nationale de Culture de Microorganismes, Institut Pasteur, 25, Rue de 
Docteur Roux, 75724 Paris Cedex 15. 

5 

Example 1 

Sources of Enzymes 

L-glycerol 3-phosphate oxidase (1.1.3.21) from Aerococcus viridans, 
10 sucrose phosphorylase (2.4.1.7), fructose 6-phosphate kinase (2.7.1.90) 
from Propionibacterium freudenreichii, rabbit muscle aldolase (RAMA), 
formate dehydrogenase, glycerolphosphate dehydrogenase (GDH), 
triosephosphate isomerase (TIM), catalase, glycerol 3-phosphate oxidase 
and maltose phosphorylase were obtained from commercial sources (Roche 
15 Diagnostics, Sigma) or as described in the literature. 

FDP aldolase II (4.1.2.13), phosphopentose aldolase (PPA, EC 4.1.2.4), 
phosphopentose mutase (PPM, EC 5.4.2.7), thymidine phosphorylase (TP, 
EC 2.4.2.4), purine nucleoside phosphorylase (PNP, EC 2.4.2. 1 ), nucleoside 
20 2-deoxyribosyl transferase (NdT, EC 2.4.2.6) were obtained from E.coli 
strains deposited at CNCM (see above). 

Example 2 

25 Protocol of the synthesis of deoxyadenosine 

Reaction mixture A was prepared by adding acetaldehyde (final 
concentration 250 mM), FDP aldolase II (0.5 U/ml), PPA (2.5 U/ml) to 20 
ml of 100 mM fructose-1 ,6-diphosphate (FDP), pH 7.6 and incubating 
30 overnight at 4°C. 



-08-1999 



I EP9911 6425.2 



DESC 



- 16- 

Mixture B was prepared by adding MnCI 2 (final concentration 0.6 mM), 
glucose 1,6-diphosphate {1 5/;M), PPM (1.5 U/ml), PNP (0.4 U/ml), SP (1.5 
U/rnl) pentosephosphate aldolase, PPA (2 U/ml) and FDP aldolase il (0.5 
U/ml) to 10 ml 0.9 M sucrose, pH 7.6, at room temperature. 



10 



15 



2 ml of A were added over B at a temperature of 20°C. After 1 hour 2.5 ml 
A were added. After another hour 3.0 ml A were added. After another 1.5 
h 3.5 ml A were added. After another 1 .5 h 4 ml A were added and after 
another 1-1.5 h 5 ml A were added and left to stand overnight. 

At each time of addition of A the amounts of FDP, dR5P, dX1P and dA in 
the reaction mixture were determined and the yield was calculated. The 
concentration of acetaldehyde was kept between 20-30 mM. The results 
are shown in Table 1 : 

Table 1 



20 



25 



Time Volume 
(h) (ml) 


Concentrations (mM) 


Yield (mmol) 


dR5P dA dX1P 


dA 


0 12 


4 0 1.2 


0 


1 12 


3.4 3.2 1 


0.04 


2 14.5 


7.9 8.0 2.6 


0.12 


3.5 17.5 


13 16.2 4.3 


0.28 


5 21 


11.7 21.7 


0.46 


6 25 


23.7 


0.59 


22 30 


11 40.4 13.2 


1.21 


30 30 


50.3 


1.51 


54 30 

= 


8.9 60.6 


1.82 



;.■.•.^v^•.^•.-.^•. x .^^ : .^-.^•.•.^ : .-.^•.^:.:. : <. : . : . : .v.^:.:.:.^ : . : o:•:.: : ^:v•:-. 
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The starting amount of FDP was 1.92 mmol. The amount after completion 
of reaction was 0.150 mmol. Thus, 1.77 mmol were consumed, 
theoretically corresponding to 3. 54 mmol equivalents dA. The amount of dA 
formed was 1.82 mmol, leading to a yield of 51.4% based on the amount 
5 of FDP. 

Example 3 

Removal of excess FDP by means of FeCI 3 

10 

1.4 g (2.55 mmol) trisodium-fructose-1 # 6-disphosphate-octahydrate and 
430 jj\ (335 mg, 7.6 mmol) acetaldehyde were dissolved in 15 ml of water 
at 4°C. A pH of 7.9 was adjusted by means of sodium hydroxide solution. 
1 50 U pentosephosphate aldolase (PPA) were added, and cold water (4°C) 

15 was added to give 20 ml. After addition of 50 U E.coli aldolase II the 
mixture was stored at 4°C. After 2 h another 75 U PPA and 50 jjI 
acetaldehyde (390 mg, 8.9 mmol) were added. After 20 h 500 U 
triosephosphate isomerase (TIM) were added. After 120 h the solution 
contained about 68 mM FDP, about 1 2 mM dX1 P and about 45 mM dR5P. 

20 The reaction was stopped by adding 900 jj\ of a 2 M solution of iron(lll) 
chloride in 0.01 M hydrochloric acid. The precipitate was centrifuged and 
washed, the resulting solution contained about 4 mM dX1P, about 9 mM 
FDP and about 25 mM dR5P. 

25 Example 4 

Removal of excess FDP and dX1P by degradation via DHAP 

576mg(1 .05mmol)trisodium-fructose-1 ,6-disphosphate-octahydrate were 
30 dissolved in 8 ml water, and the pH was adjusted at 8.1 by means of 
sodium hydroxide solution. 75 U PPA and 27 U rabbit muscle aldolase 
(RAMA) were added, and water was added to give 1 0 ml. 570 //I (440 mg, 
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10 mmol) acetaldehyde were added. The reaction was stored at 4°C. After 
100 h the solution contained about 1 10 mM dXlP, about 5 mM FDP and 
about 85 mM dR5P (about 870//mol). The reaction was stopped by adding 
hydrochloric acid until a pH of 2 was reached. After adding sodium 
hydroxide solution to give a pH of 5.5 the solution was stored. 

For removing dX1 P the acetaldehyde was evaporated and the solution was 
diluted with water to reach 30 ml. It was mixed with 3 ml 2.65 M sodium 
formate solution (8 mmol), and sodium hydroxide solution was added until 
a pH of 7.4 was reached. 23 U formate dehydrogenase (FDH), 6 mg NADH, 
16 U RAMA and 20 U glycerolphosphate dehydrogenase (GDH) were 
added. 



15 



After 24 h at room temperature the concentrations of dX1P and FDP are 
below 3 mM, the loss of dR5P is less than 10%. 



Example 5 



20 



25 



Preparation of dR5P via G3P 

1.1 g (2.0 mmol) trisodium-fructose-1,6-disphosphate-octahydrate were 
dissolved in 8 ml water. 1 ,58 mol of a 2.65 M sodium formate solution {4.2 
mmol) and 14.2 mg NADH were added. A pH of 7.0 was adjusted by 
means of NaOH. After addition of 36 U RAMA, 50 U triosephosphate 
isomerase (TIM), 34 U GDH and 35 U FDH water was added to give 1 2 ml. 



30 



After incubation of 40 h at room temperature the FDP content was below 

3 mM. The enzymes were denatured by acidification with hydrochloric acid 
to reach a pH of 2. Subsequently, the pH of the solution was adjusted at 

4 and the solids were centrifuged and filtered off, respectively. Through 
dilution during purification a total volume of 25 ml was reached which 
contained about 160 mM of glycerol-3-phosphate (G3P). 
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4 ml of this solution (about 640 //mol G3P) were adjusted at a pH of 7.8 by 
means of sodium hydroxide solution. 7.8 kU catalase, 500 U TIM and 13 
U glycerol 3-phosphate oxidase are added. The mixture was stirred very 
slowly in an open flask. After 30 min 1 8 U PPA were added, Acetaldehyde 

5 was added in portions of 30 jj\ (23.5 mg, 530 //mol) after 30, 60, 1 20, 1 80 
and 240 min. After 24 h another 15 U PPA, 2.5 kU TIM and 100 //I (78 mg, 
1.8 mmol) acetaldehyde were added. After 30 h the batch is sealed after 
addition of another 100 //I acetaldehyde. After a total of 45 h a 
concentration of about 60 mM dR5P was achieved and the reaction is 

10 completed. For preparing 2'-deoxyadenosine (e.g. Example 7) excess 
acetaldehyde must be distilled off. 

Example 6 

15 Preparation of a dR5P solution containing small amounts of dX1P or FDP 

A solution of 60 mmol/I FDP and 120 mmol/l acetaldehyde having pH 7.4 
was kept at a temperature of 15°C. 5 ml thereof were mixed with 4 U 
aldolase II, 2 U TIM and 40 U PPA and kept at 15°C. After 4, 8.5, 16.5 

20 and 24 h 12 U PPA and 100 //I of a 34 vol.-% solution of acetaldehyde in 
water (26.4 mg, 600 //mol) were added each. After 40 h the solution was 
allowed to reach room temperature. After 90 h the reaction solution had 
reached concentrations of about 3 mM FDP, about 4 mM dX1 P and at least 
70 mM dR5P. For stopping the reaction and removing acetaldehyde about 

25 20% of the volume were distilled off. 

Example 7 

Preparation of deoxyadenosine (dA) from dR5P by means of barium acetate 

30 

dR5P was used in the form of a solution prepared according to Examples 3- 
6. For instance, 10 ml of a solution of Example 6 diluted to have 70 mM 
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dR5P (700 //mol dR5P) were mixed with 40 mg (300 //mol) adenine, 41 //g 
(50 nmol) tetracyclohexyIammonium-glucose-1,6-disphosphate, 396 //g (2 
//mol) manganese-ll-acetate-tetrahydrate, 10 U pentosephosphate mutase 
(PPM) and 30 U purine-nucleoside phosphorylase (PNP). After 3 h another 
27 mg (200 //mol) adenine and 26 mg (100 //mol) barium acetate were 
added. 



10 



A further amount of 26 mg barium acetate was added after 4 h, one of 40 
mg adenine after 7 h. After 10 h the reaction was completed. The solution 
had a concentration of 45 mM dA. 



Example 8 

Preparation of deoxyadenosine (dA) from dR5P by means of sucrose 
15 phosphorylase 

10 ml of a solution of Example 6 diluted to 55 mM dR5P (550 //mol dR5P) 
were mixed with 81 mg (600 //mol) adenine, 41 fjg (50 nmol) 
tetracyclohexylammonium-glucose-l^-disphosphate, 396 //g (2 //mol) 
20 manganese-II-acetate-tetrahydrate, 10 U pentosephosphate mutase (PPM) 
1 5 U purine nucleoside phosphorylase (PNP) f 25 U sucrose phosphorylase 
and 340 mg (1 mmol) cane sugar. 

After 3 h at room temperature the reaction was completed. The solution 
25 had a concentration of about 50 mM dA. 



Example 9 



30 



Preparation of deoxyadenosine (dA) from dR5P by means of maltose 
phosphorylase 
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10 ml of a solution of dR5P diluted to 55 mM were mixed at pH 7.0 with 
81 mg (600/ymol) adenine, 41 jjg (50 nmoles) glucose 1 ,6-diphosphate, 
396 /yg (2 //moles) manganese ll-acetate tetrahydrate, 5 units pentose 
phosphate mutase (PPM), 1 0 units purine nucleoside phosphorylase, (PNP), 
5 20 units maltose phosphorylase and 1080 mg (3 mmoles) maltose. 



After 12h at room temperature the reaction was completed. The solution 
had a concentration of 49 mM dA. 



10 Example 10 

Preparation of deoxycytosine (dC) from dR5P by means of sucrose 
phosphorylase 



is 20 ml of a solution of dR5P diluted to 70 mM were mixed at pH 7.0 with 
5.4 mg adenine (0.04 mmoles), 1 55 mg cytosine (1.4 mmoles), 82 jjg (100 
nmoles) glucose 1 ,6-diphosphate, 792 //g (4//moles) manganese ll-acetate- 
tetrahydrate, 20 units PPM, 30 units PNP, 50 units 2-deoxyribosyl 
transferase (NdT), 50 units sucrose phosphorylase and 2.05 g (6 mmoles) 

20 sucrose. 

After 18h at 30°C the solution had a concentration of 62 mM dC. 



Example 11 

25 

Preparation of deoxyguanosine (dG) from dR5P by means of sucrose 
phosphorylase 



30 



20 ml of a solution of dR5P diluted to 70 mM were mixed at pH 7.0 with 
91 mg guanine (0.6 mmoles), 82 /jq (100 nmoles) glucose 1,6- 
diphosphate, 792/vg (4//moles) manganese ll-acetate-tetrahydrate, 20 units 
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PPM, 1 0 units PNP, 20 units sucrose phosphorylase and 2.05 g (6 mmoles) 
sucrose. 



After 18h at 37°C the dG formed corresponds to 0.5 mmoles. 
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SEQUENCE LISTING 



5 <110> Institut Pasteur 

Roche Diagnostics GmbH 
Pharma Waldhof GmbH 

<120> Enzymatic synthesis of deoxyribonucleosides 

10 

<130> 20373pep deoxyribonucleosides 

<140> 
<141> 

15 

<160> 14 

<170> Patentln Ver. 2.1 

20 <210> 1 
<211> 1323 
<212> DNA 

<213> Escherichia coli 

25 <220> 

<221> CDS . 

<222> (1) . . (1320) 

<400> 1 

30 ttg ttt etc gca caa gaa att att cgt aaa aaa cgt gat ggt cat gcg 48 

Leu Phe Leu Ala Gin Glu lie lie Arg Lys Lys Arg Asp Gly His Ala 
15 10 15 

ctg age gat gaa gaa att cgt ttc ttt ate aac ggt att cgc gac aac 96 
35 Leu Ser Asp Glu Glu lie Arg Phe Phe lie Asn Gly He Arg Asp Asn 
20 25 30 

act ate tec gaa ggg cag att gee gee etc gcg atg ace att ttc ttc 144 
Thr He Ser Glu Gly Gin He Ala Ala Leu Ala Met Thr He Phe Phe 
40 35 40 45 



cac gat atg aca atg cct gag cgt gtc teg ctg ace atg gcg atg cga 
Hio Asp Met Thr Met Pro Glu Arg Val Ser Leu Thr Met Ala Met Arg 
50 55 60 



192 
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gat tea gga acc gtt etc gac tgg aaa age ctg cat ctg aat ggc ccg 
Asp Ser Gly Thr Val Leu Asp Trp Lys Ser Leu His Leu Asn Gly Pro 
65 70 75 80 



240 



5 att gtt gat aaa cac tec acc ggt ggc gtc ggc gat gtg act teg ctg 
lie Val Asp Lys His Ser Thr Gly Gly Val Gly Asp Val Thr Ser Leu 
85 90 95 



288 



• atg ttg ggg ccg atg gtc gca gec tgc ggc ggc tat att ccg atg ate 
10 Met Leu Gly Pro Met Val Ala Ala Cys Gly Gly Tyr He Pro Met He 
100 105 110 



336 



tct ggt cgc ggc etc ggt cat act ggc ggt acg etc gac aaa ctg gaa 
Ser Gly Arg Gly Leu Gly His Thr Gly Gly Thr Leu Asp Lys Leu Glu 
15 115 120 125 



384 



20 



tec ate cct ggc ttc gac att ttc ccg gat gac aac cgt ttc cgc gaa 
Ser He Pro Gly Phe Asp He Phe Pro Asp Asp Asn Arg Phe Arg Glu 
130 135 140 

att att aaa gac gtc ggc gtg gcg att ate ggt cag acc agt tea ctg 
He He Lys Asp Val Gly Val Ala He He Gly Gin Thr Ser Ser Leu 
145 150 155 160 



432 



480 



25 get ccg get gat aaa cgt ttc tac gcg acc cgt gat att acc gca acc 
Ala Pro Ala Asp Lys Arg Phe Tyr Ala Thr Arg Asp He Thr Ala Thr 
165 170 175 



528 



gtg gac tec ate ccg ctg ate acc gec tct att ctg gcg aag aaa ctt 
30 Val Asp Ser He Pro Leu He Thr Ala Ser He Leu Ala Lys Lys Leu 
180 185 190 



576 



gcg gaa ggt ctg gac gcg ctg gtg atg gac gtg aaa gtg ggt age ggc 
Ala Glu Gly Leu Asp Ala Leu Val Met Asp Val Lys Val Gly Ser Gly 
35 195 200 205 



624 



40 



gcg ttt atg ccg acc tac gaa etc tct gaa gec ctt gec gaa gcg att 
Ala Phe Met Pro Thr Tyr Glu Leu Ser Glu Ala Leu Ala Glu Ala He 
210 215 220 

gtt ggc gtg get aac ggc get ggc gtg cgc acc acc gcg ctg etc acc 
Val Gly Val Ala Asn Gly Ala Gly Val Arg Thr Thr Ala Leu Leu Thr 
225 230 235 240 



672 



720 



45 gac atg aat cag gta ctg gee tec agt gca ggt aac gcg gtt gaa gtt 



768 



s 
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Asp Met Asn Gin Val Leu Ala Ser Ser Ala Gly Asn Ala Val Glu Val 
245 250 255 

cgt gaa gcg gtg cag ttc ctg acg ggt gaa tat cgt aac ccg cgt ctg 816 
5 Arg Glu Ala Val Gin Phe Leu Thr Gly Glu Tyr Arg Asn Pro Arg Leu 
260 265 270 



ttt gat gtc acg atg 
Phe Asp Val Thr Met 
10 275 

ctg gcg aaa gat gac 
Leu Ala Lys Asp Asp 
290 

15 

gac aac ggt aaa gcg 
Asp Asn Gly Lys Ala 
305 

20 aaa ggc ccg acc gac 
Lys Gly Pro Thr Asp 
325 

gcg atg ctg acg aaa 
25 Ala Met Leu Thr Lys 
340 

gaa atg gat acc cgc 
Glu Met Asp Thr Arg 
30 355 

gga cgc cgt cag gca 
Gly Arg Arg Gin Ala 

370 

35 

gat atg gcg cgt ctg 
Asp Met Ala Arg Leu 
385 

40 gtt ate cac gcg aaa 
Val lie His Ala Lys 
405 

gtg aaa gcg gca att 
45 Val Lys Ala Ala lie 



gcg ctg tgc gtg gag 
Ala Leu Cys Val Glu 
280 

gec gaa gcg cgc gcg 
Ala Glu Ala Arg Ala 
295 

gca gaa gtc ttt ggt 
Ala Glu Val Phe Gly 
310 

ttc gtt gag aac tac 
Phe Val Glu Asn Tyr 
330 

gca gtc tat get gat 
Ala Val Tyr Ala Asp 
345 

gcg ctg ggg atg gca 
Ala Leu Gly Met Ala 
360 

tct gac acc ate gat 
Ser Asp Thr lie Asp 
375 

ggc gac cag gta gac 
Gly Asp Gin Val Asp 
390 

gac gaa aac aac tgg 
Asp Glu Asn Asn Trp 
410 

aaa ctt gec gat aaa 
Lys Leu Ala Asp Lys 



atg ctg ate tec ggc aaa 
Met Leu lie Ser Gly Lys 
285 

aaa ttg cag gcg gtg ctg 
Lys Leu Gin Ala Val Leu 
300 

cgt atg gta gcg gca caa 
Arg Met Val Ala Ala Gin 
315 320 

gcg aag tat ctg ccg aca 
Ala Lys Tyr Leu Pro Thr 
335 

acc gaa ggt ttt gtc agt 
Thr Glu Gly Phe Val Ser 
350 

gtg gtt gca atg ggc ggc 
Val Val Ala Met Gly Gly 
365 

tac age gtc ggc ttt act 
Tyr Ser Val Gly Phe Thr 
380 

ggt cag cgt ccg ctg gcg 
Gly Gin Arg Pro Leu Ala 
395 400 

cag gaa gcg gcg aaa gcg 
Gin Glu Ala Ala Lys Ala 
415 

gca ccg gaa age aca cca 
Ala Pro Glu Ser Thr Pro 



864 



912 

i 

960 



1008 



1056 



1104 

i 

1152 



1200 



1248 



1296 



-08-1999 



• 



DESC 



26 - 



420 



425 



430 



act gtc tat cgc cgt ate age gaa taa 
Thr Val Tyr Arg Arg lie Ser Glu 
5 435 440 



1323 



<210> 2 
<211> 440 
10 <212> PRT 

<213> Escherichia coli 

<400> 2 

Leu Phe Leu Ala Gin Glu He He Arg Lys Lys Arg Asp Gly His Ala 
15 1 5 10 15 

Leu Ser Asp Glu Glu He Arg Phe Phe He Asn Gly He Arg Asp Asn 
20 25 30 

20 Thr He Ser Glu Gly Gin He Ala Ala Leu Ala Met Thr He Phe Phe 
35 40 45 



25 



His Asp Met Thr Met Pro Glu Arg Val Ser Leu Thr Met Ala Met Arg 
50 55 60 

Asp Ser Gly Thr Val Leu Asp Trp Lys Ser Leu His Leu Asn Gly Pro 
65 70 75 80 



30 



He Val Asp Lys His Ser Thr Gly Gly Val Gly Asp Val Thr Ser Leu 
85 90 95 



Met Leu Gly Pro Met Val Ala Ala Cys Gly Gly Tyr He Pro Met He 
100 105 HO 

35 Ser Gly Arg Gly Leu Gly His Thr Gly Gly Thr Leu Asp Lys Leu Glu 
115 120 125 



40 



Ser He Pro Gly Phe Asp He Phe Pro Asp Asp Asn Arg Phe Arg Glu 
130 135 140 

He He Lys Asp Val Gly Val Ala He He Gly Gin Thr Ser Ser Leu 
145 150 155 160 



Ala Pro Ala Asp Lys Arg Phe Tyr Ala Thr Arg Asp He Thr Ala Thr 
45 165 170 175 
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Val Asp Ser He Pro Leu He Thr Ala Ser He Leu Ala Lys Lys Leu 
180 185 190 

Ala Glu Gly Leu Asp Ala Leu Val Met Asp Val Lys Val Gly Ser Gly 
5 195 200 205 

Ala Phe Met Pro Thr Tyr Glu Leu Ser Glu Ala Leu Ala Glu Ala He 
210 215 220 

10 Val Gly Val Ala Asn Gly Ala Gly Val Arg Thr Thr Ala Leu Leu Thr 
225 230 235 240 

Asp Met Asn Gin Val Leu Ala Ser Ser Ala Gly Asn Ala Val Glu Val 
245 250 255 

15 

Arg Glu Ala Val Gin Phe Leu Thr Gly Glu Tyr Arg Asn Pro Arg Leu 
260 265 270 

Phe Asp Val Thr Met Ala Leu Cys Val Glu Met Leu He Ser Gly Lys 
20 275 280 285 

Leu Ala Lys Asp Asp Ala Glu Ala Arg Ala Lys Leu Gin Ala Val Leu 
290 ' 295 300 

25 Asp Asn Gly Lys Ala Ala Glu Val Phe Gly Arg Met Val Ala Ala Gin 
305 310 315 320 

Lys Gly Pro Thr Asp Phe Val Glu Asn Tyr Ala Lys Tyr Leu Pro Thr 
325 330 335 

30 

Ala Met Leu Thr Lys Ala Val Tyr Ala Asp Thr Glu Gly Phe Val Ser 
340 345 350 

Glu Met Asp Thr Arg Ala Leu Gly Met Ala Val Val Ala Met Gly Gly 
35 355 360 365 

Gly Arg Arg Gin Ala Ser Asp Thr He Asp Tyr Ser Val Gly Phe Thr 
370 375 380 

40 Asp Met Ala Arg Leu Gly Asp Gin Val Asp Gly Gin Arg Pro Leu Ala 
385 390 395 400 

Val He His Ala Lys Asp Glu Aon Asn Trp Gin Glu Ala Ala Lys Ala 
405 410 415 

45 
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Val Lys Ala Ala lie Lys Leu Ala Asp Lys Ala Pro Glu Ser Thr Pro 
420 425 430 



Thr Val Tyr Arg Arg lie Ser Glu 
5 435 440 



<210> 3 
10 <211> 720 
<212> DNA 

<213> Escherichia coli 

<220> 
15 <221> CDS 

<222> (1) . . (717) 



<400> 3 

atg get acc cca cac att aat gca gaa atg ggc gat ttc get gac gta 

20 Met Ala Thr Pro His lie Asn Ala Glu Met Gly Asp Phe Ala Asp Val 
15 10 15 



48 



gtt ttg atg cca ggc gac ccg ctg cgt gcg aag tat att get gaa act 96 
Val Leu Met Pro Gly Asp Pro Leu Arg Ala Lys Tyr lie Ala Glu Thr 
25 20 25 30 



30 



ttc ctt gaa gat gec cgt gaa gtg aac aac gtt cgc ggt atg ctg ggc 144 
Phe Leu Glu Asp Ala Arg Glu Val Asn Asn Val Arg Gly Met Leu Gly 
35 40 45 

ttc acc ggt act tac aaa ggc cgc aaa att tec gta atg ggt cac ggt 192 
Phe Thr Gly Thr Tyr Lys Gly Arg Lys lie Ser Val Met Gly His Gly 
50 55 60 



35 atg ggt ate ccg tec tgc tec ate tac acc aaa gaa ctg ate acc gat 

Met Gly lie Pro Ser Cys Ser lie Tyr Thr Lys Glu Leu lie Thr Asp 

65 70 75 80 

ttc ggc gtg aag aaa att ate cgc gtg ggt tec tgt ggc gca gtt ctg 

40 Phe Gly Val Lys Lys lie lie Arg Val Gly Ser Cys Gly Ala Val Leu 

85 90 95 



240 



288 



ccg cac gta aaa ctg cgc gac gtc gtt ate ggt atg ggt gee tgc acc 
Pro His Val Lys Leu Arg Asp Val Val lie Gly Met Gly Ala Cys Thr 
45 100 105 110 



336 



20-08-1999 EP9911 6425.2 ^ DESG 



-29- 

gat tec aaa gtt aac cgc ate cgt ttt aaa gac cat gac ttt gee get 3 84 
Asp Ser Lys Val Asn Arg lie Arg Phe Lys Asp His Asp Phe Ala Ala 
115 120 125 

5 ate get gac ttc gac atg gtg cgt aac gca gta gat gca get aaa gca 432 
lie Ala Asp Phe Asp Met Val Arg Asn Ala Val Asp Ala Ala Lys Ala 
130 135 140 

ctg ggt att gat get cgc gtg ggt aac ctg ttc tec get gac ctg ttc 480 
10 Leu Gly He Asp Ala Arg Val Gly Asn Leu Phe Ser Ala Asp Leu Phe 
145 150 155 160 

tac tct ccg gac ggc gaa atg ttc gac gtg atg gaa aaa tac ggc att 528 
Tyr Ser Pro Asp Gly Glu Met Phe Asp Val Met Glu Lys Tyr Gly He 
15 165 170 175 

etc ggc gtg gaa atg gaa gcg get ggt ate tac ggc gtc get gca gaa 576 

Leu Gly Val Glu Met Glu Ala Ala Gly He Tyr Gly Val Ala Ala Glu 

180 185 190 

20 

ttt ggc gcg aaa gee ctg ace ate tgc ace gta tct gac cac ate cgc 624 

Phe Gly Ala Lys Ala Leu Thr He Cys Thr Val Ser Asp His He Arg 

195 200 205 

25 act cac gag cag ace act gee get gag cgt cag act acc ttc aac gac 672 
Thr His Glu Gin Thr Thr Ala Ala Glu Arg Gin Thr Thr Phe Asn Asp 
210 215 220 

atg ate aaa ate gca ctg gaa tec gtt ctg ctg ggc gat aaa gag taa 720 
30 Met He Lys He Ala Leu Glu Ser Val Leu Leu Gly Asp Lys Glu 
225 230 235 



<210> 4 
35 <211> 239 
<212> PRT 

<213> Escherichia coli 
<400> 4 

40 Met Ala Thr Pro His He Asn Ala Glu Met Gly Asp Phe Ala Asp Val 
15 10 15 



Val Leu Met Pro Gly Asp Pro Leu Arg 
20 25 

45 



Ala Lys Tyr He 



Ala Glu Thr 
30 



.-08-1999 



EP991 16425.2 



- 30- 

Phe Leu Glu Asp Ala Arg Glu Val Asn Asn Val Arg Gly Met Leu Gly 
35 40 45 

Phe Thr Gly Thr Tyr Lys Gly Arg Lys lie Ser Val Met Gly His Gly 
5 50 55 60 

Met Gly lie Pro Ser Cys Ser lie Tyr Thr Lys Glu Leu lie Thr Asp 
65 70 75 80 

10 Phe Gly Val Lys Lys lie lie Arg Val Gly Ser Cys Gly Ala Val Leu 

85 90 95 

Pro His Val Lys Leu Arg Asp Val Val He Gly Met Gly Ala Cys Thr 
100 105 110 

15 

Asp Ser Lys Val Asn Arg He Arg Phe Lys Asp His Asp Phe Ala Ala 
115 120 125 

He Ala Asp Phe Asp Met Val Arg Asn Ala Val Asp Ala Ala Lys Ala 
20 130 135 140 

Leu Gly He Asp Ala Arg Val Gly Asn Leu Phe Ser Ala Asp Leu Phe 
145 150 155 160 

25 Tyr Ser Pro Asp Gly Glu Met Phe Asp Val Met Glu Lys Tyr Gly He 

165 170 175 

Leu Gly Val Glu Met Glu Ala Ala Gly He Tyr Gly Val Ala Ala Glu 
180 185 190 

30 

Phe Gly Ala Lys Ala Leu Thr He Cys Thr Val Ser Asp His He Arg 
195 200 205 

Thr His Glu Gin Thr Thr Ala Ala Glu Arg Gin Thr Thr Phe Asn Asp 
35 210 215 220 

Met He Lys He Ala Leu Glu Ser Val Leu Leu Gly Asp Lys Glu 
225 230 235 



40 



<210> 5 
<211> 1224 
<212> DNA 
45<213> Escherichia coli 
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EP99 116425.2 



liii 



- 31 - 

<220> 

<221> CDS 

<222> (1) . . (1221) 

5 <400> 5 

atg aaa cgt gca ttt att atg gtg ctg gac tea ttc ggc ate ggc get 48 

Met Lys Arg Ala Phe lie Met Val Leu Asp Ser Phe Gly lie Gly Ala 

1 5 -10 15 

10aca gaa gat gca gaa cgc ttt ggt gac gtc ggg get gac acc ctg ggt 96 
Thr Glu Asp Ala Glu Arg Phe Gly Asp Val Gly Ala Asp Thr Leu Gly 
20 25 30 

cat ate gca gaa get tgt gee aaa ggc gaa get gat aac ggt cgt aaa 144 
15 His lie Ala Glu Ala Cys Ala Lys Gly Glu Ala Asp Asn Gly Arg Lys 
35 40 45 

ggc ccg etc aat ctg cca aat ctg acc cgt ctg ggg ctg gcg aaa gca 192 
Gly Pro Leu Asn Leu Pro Asn Leu Thr Arg Leu Gly Leu Ala Lys Ala 
20 50 55 60 

cac gaa ggt tct acc ggt ttc att ccg gcg gga atg gac ggc aac get 240 
His Glu Gly Ser Thr Gly Phe lie Pro Ala Gly Met Asp Gly Asn Ala 
65 70 75 80 

25 

gaa gtt ate ggc gcg tac gca tgg gcg cac gaa atg tea tec ggt aaa 288 
Glu Val lie Gly Ala Tyr Ala Trp Ala His Glu Met Ser Ser Gly Lys 
85 90 95 

30 gat acc ccg tct ggt cac tgg gaa att gec ggt gtc ccg gtt ctg ttt 336 
Asp Thr Pro Ser Gly His Trp Glu lie Ala Gly Val Pro Val Leu Phe 
100 105 110 

gag tgg gga tat ttc tec gat cac gaa aac age ttc ccg caa gag ctg 384 
35 Glu Trp Gly Tyr Phe Ser Asp His Glu Asn Ser Phe Pro Gin Glu Leu 
115 120 125 

ctg gat aaa ctg gtc gaa cgc get aat ctg ccg ggt tac etc ggt aac 432 
Leu Asp Lys Leu Val Glu Arg Ala Asn Leu Pro Gly Tyr Leu Gly Asn 
40 130 135 140 

tgc cac tct tec ggt acg gtc att ctg gat caa ctg ggc gaa gag cac 480 

Cyo His Ser Ser Gly Thr Val lie Leu Asp Gin Leu Gly Glu Glu His 

145 150 155 160 

45 
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atg aaa acc ggc aag ccg att ttc tat acc tec get gac tec gtg ttc 528 
Met Lys Thr Gly Lys Pro lie Phe Tyr Thr Ser Ala Asp Ser Val Phe 
165 170 175 

5 cag att gec tgc cat gaa gaa act ttc ggt ctg gat aaa etc tac gaa 576 
Gin lie Ala Cys His Glu Glu Thr Phe Gly Leu Asp Lys Leu Tyr Glu 
180 185 190 

ctg tgc gaa ate gec cgt gaa gag ctg acc aac ggc ggc tac aat ate 624 
10 Leu Cys Glu lie Ala Arg Glu Glu Leu Thr Asn Gly Gly Tyr Asn lie 
195 200 205 

ggt cgt gtt ate get cgt ccg ttt ate ggc gac aaa gec ggt aac ttc 672 
Gly Arg Val lie Ala Arg Pro Phe lie Gly Asp Lys Ala Gly Asn Phe 
15 210 215 220 

cag cgt acc ggt aac cgt cac gac ctg get gtt gag ccg cca gca ccg 720 
Gin Arg Thr Gly Asn Arg His Asp Leu Ala Val Glu Pro Pro Ala Pro 
225 230 235 240 

20 

acc gtg ctg cag aaa ctg gtt gat gaa aaa cac ggc cag gtg gtt tct 768 
Thr Val Leu Gin Lys Leu Val Asp Glu Lys His Gly Gin Val Val Ser 
245 250 255 

25 gtc ggt aaa att gcg gac ate tac gec aac tgc ggt ate acc aaa aaa 816 
Val Gly Lys lie Ala Asp He Tyr Ala Asn Cys Gly He Thr Lys Lys 
260 265 270 

gtg aaa gcg act ggc ctg gac gcg ctg ttt gac gee acc ate aaa gag 864 
30 Val Lys Ala Thr Gly Leu Asp Ala Leu Phe Asp Ala Thr He Lys Glu 
275 280 285 

atg aaa gaa gcg ggt gat aac acc ate gtc ttc acc aac ttc gtt gac 912 
Met Lys Glu Ala Gly Asp Asn Thr He Val Phe Thr Asn Phe Val Asp 
35 290 295 300 

ttc gac tct tec tgg ggc cac cgt cgc gac gtc gec ggt tat gec gcg 960 

Phe Asp Ser Ser Trp Gly His Arg Arg Asp Val Ala Gly Tyr Ala Ala 

305 310 31S 320 

40 

ggt ctg gaa ctg ttc gac cgc cgt ctg ccg gag ctg atg tct ctg ctg 1008 

Gly Leu Glu Leu Phe Asp Arg Arg Leu Pro Glu Leu Met Ser Leu Leu 
325 330 335 



45 cgc gat gac gac ate ctg ate etc acc get gac cac ggt tgc gat ccg 



1056 
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Arg Asp Asp Asp lie Leu lie Leu Thr Ala Asp His Gly Cys Asp Pro 
340 345 350 

acc tgg acc ggt act gac cac acg cgt gaa cac att ccg gta ctg gta 1104 
5 Thr Trp Thr Gly Thr Asp His Thr Arg Glu His lie Pro Val Leu Val 
355 360 365 

tat ggc ccg aaa gta aaa ccg ggc tea ctg ggt cat cgt gaa acc ttc 1152 
Tyr Gly Pro Lys Val Lys Pro Gly Ser Leu Gly His Arg Glu Thr Phe 
10 370 375 380 

gcg gat ate ggc cag act ctg gca aaa tat ttt ggt act tct gat atg 1200 
Ala Asp lie Gly Gin Thr Leu Ala Lys Tyr Phe Gly Thr Ser Asp Met 
385 390 395 400 

15 

gaa tat ggc aaa gec atg ttc tga 1224 
Glu Tyr Gly Lys Ala Met Phe 
405 

20 

<210> 6 
<211> 407 
<212> PRT 

<213> Escherichia coli 

25 

<400> 6 

Met Lys Arg Ala Phe lie Met Val Leu Asp Ser Phe Gly lie Gly Ala 
15 10 15 

30 Thr Glu Asp Ala Glu Arg Phe Gly Asp Val Gly Ala Asp Thr Leu Gly 
20 25 30 

His lie Ala Glu Ala Cys Ala Lys Gly Glu Ala Asp Asn Gly Arg Lys 
35 40 45 

35 

Gly Pro Leu Asn Leu Pro Asn Leu Thr Arg Leu Gly Leu Ala Lys Ala 
50 55 60 

His Glu Gly Ser Thr Gly Phe lie Pro Ala Gly Met Asp Gly Asn Ala 
40 65 70 75 80 

Glu Val lie Gly Ala Tyr Ala Trp Ala His Glu Met Ser Ser Gly Lys 
85 90 95 



45 Asp Thr Pro Ser Gly His Trp Glu lie Ala Gly Val Pro Val Leu Phe 



34 



100 



105 



110 



Glu Trp Gly Tyr Phe Ser Asp His Glu Asn Ser Phe Pro Gin Glu Leu 
115 120 125 

5 

Leu Asp Lys Leu Val Glu Arg Ala Asn Leu Pro Gly Tyr Leu Gly Asn 
130 135 . 140 

Cys His Ser Ser Gly Thr Val lie Leu Asp Gin Leu Gly Glu Glu His 
10 145 150 155 160 

Met Lys Thr Gly Lys Pro lie Phe Tyr Thr Ser Ala Asp Ser Val Phe 
165 170 175 

15 Gin lie Ala Cys His Glu Glu Thr Phe Gly Leu Asp Lys Leu Tyr Glu 
180 185 190 

Leu Cys Glu lie Ala Arg Glu Glu Leu Thr Asn Gly Gly Tyr Asn lie 
195 200 205 

20 

Gly Arg Val He Ala Arg Pro Phe lie Gly Asp Lys Ala Gly Asn Phe 
210 215 220 

Gin Arg Thr Gly Asn Arg His Asp Leu Ala Val Glu Pro Pro Ala Pro 
25 225 230 235 240 

Thr Val Leu Gin Lys Leu Val Asp Glu Lys His Gly Gin Val Val Ser 
245 250 255 

30 Val Gly Lys He Ala Asp He Tyr Ala Asn Cys Gly He Thr Lys Lys 
260 265 270 

Val Lys Ala Thr Gly Leu Asp Ala Leu Phe Asp Ala Thr He Lys Glu 
275 280 285 

35 

Met Lys Glu Ala Gly Asp Asn Thr He Val Phe Thr Asn Phe Val Asp 
290 295 300 

Phe Asp Ser Ser Trp Gly His Arg Arg Asp Val Ala Gly Tyr Ala Ala 
40 305 310 315 320 

Gly Leu Glu Leu Phe Asp Arg Arg Leu Pro Glu Leu Met Ser Leu Leu 
325 330 335 



45 Arg Asp Asp Asp He Leu He Leu Thr Ala Asp Hie Gly Cys Asp Pro 
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• 35 - 

340 345 350 



Thr Trp Thr Gly Thr 
355 

5 

Tyr Gly Pro Lys Val 
370 

Ala Asp lie Gly Gin 
10 385 



Asp His Thr Arg Glu 
360 

Lys Pro Gly Ser Leu 
375 

Thr Leu Ma Lys Tyr 
390 



His lie Pro Val Leu Val 
365 

Gly His Arg Glu Thr Phe 
380 

Phe Gly Thr Ser Asp Met 
395 400 



Glu Tyr Gly Lys Ala Met Phe 
405 

i 

15 



<210> 7 
<211> 780 
<212> DNA 
20 <213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . (777) 

<400> 7 

atg act gat ctg aaa gca age age 
Met Thr Asp Leu Lys Ala Ser Ser 
1 5 

ctg aac acc ctg aat gac gac gac 
Leu Asn Thr Leu Asn Asp Asp Asp 
20 



ctg cgt gca ctg aaa ttg atg gac 48 
Leu Arg Ala Leu Lys Leu Met Asp 
10 15 

acc gac gag aaa gtg ate gec ctg 96 
Thr Asp Glu Lys Val lie Ala Leu 
25 30 



35 tgt cat cag gec aaa act ccg gtc ggc aat acc gec get ate tgt ate 144 
Cys His Gin Ala Lys Thr Pro Val Gly Asn Thr Ala Ala He Cys He 
35 40 45 



tat cct cgc ttt ate ccg att get cgc aaa act ctg aaa gag cag ggc 192 
40 Tyr Pro Arg Phe He Pro He Ala Arg Lys Thr Leu Lys Glu Gin Gly 
50 55 60 



acc ccg gaa ate cgt ate get acg gta acc aac ttc cca cac ggt aac 
Thr Pro Glu He Arg He Ala Thr Val Thr Asn Phe Pro His Gly Asn 
45 65 70 75 80 



■ 
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gac gac ate gac ate gcg ctg gca gaa acc cgt gcg gca ate gec tac 288 
Asp Asp lie Asp lie Ala Leu Ala Glu Thr Arg Ala Ala lie Ala Tyr 
85 90 95 

5 ggt get gat gaa gtt gac gtt gtg ttc ccg tac cgc gcg ctg atg gcg 336 
Gly Ala Asp Glu Val Asp Val Val Phe Pro Tyr Arg Ala Leu Met Ala 
100 105 110 

ggt aac gag cag gtt ggt ttt gac ctg gtg aaa gec tgt aaa gag get 384 
10 Gly Asn Glu Gin Val Gly Phe Asp Leu Val Lys Ala Cys Lys Glu Ala 
115 120 125 

tgc gcg gca gcg aat gta ctg ctg aaa gtg ate ate gaa acc ggc gaa 432 
Cys Ala Ala Ala Asn Val Leu Leu Lys Val lie lie Glu Thr Gly Glu 
15 130 135 140 

ctg aaa gac gaa gcg ctg ate cgt aaa gcg tct gaa ate tec ate aaa 480 

Leu Lys Asp Glu Ala Leu lie Arg Lys Ala Ser Glu lie Ser lie Lys 

145 150 155 160 

20 

gcg ggt gcg gac ttc ate aaa acc tct acc ggt aaa gtg get gtg aac 528 

Ala Gly Ala Asp Phe lie Lys Thr Ser Thr Gly Lys Val Ala Val Asn 
165 170 175 

25 gcg acg ccg gaa age gcg cgc ate atg atg gaa gtg ate cgt gat atg 576 
Ala Thr Pro Glu Ser Ala Arg lie Met Met Glu Val lie Arg Asp Met 
180 185 190 

ggc gta gaa aaa acc gtt ggt ttc aaa ccg gcg ggc ggc gtg cgt act 624 
30 Gly Val Glu Lys Thr Val Gly Phe Lys Pro Ala Gly Gly Val Arg Thr 
195 200 205 

gcg gaa gat gcg cag aaa tat etc gee att gca gat gaa ctg ttc ggt 672 
Ala Glu Asp Ala Gin Lys Tyr Leu Ala lie Ala Asp Glu Leu Phe Gly 
35 210 215 220 

get gac tgg gca gat gcg cgt cac tac cgc ttt ggc get tec age ctg 720 

Ala Asp Trp Ala Asp Ala Arg His Tyr Arg Phe Gly Ala Ser Ser Leu 
225 230 235 240 

40 

ctg gca age ctg ctg aaa gcg ctg ggt cac ggc gac ggt aag age gec 768 

Leu Ala Ser Leu Leu Lys Ala Leu Gly His Gly Asp Gly Lys Ser Ala 
245 250 255 



45 age age tac taa 



780 



llllllllll 
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37 



Ser Ser Tyr 



<210> 8 
5 <211> 259 
<212> PRT 

<213> Escherichia coli 
<400> 8 

10 Met Thr Asp Leu Lys Ala Ser Ser Leu Arg Ala Leu Lys Leu Met Asp 
15 10 15 

Leu Asn Thr Leu Asn Asp Asp Asp Thr Asp Glu Lys Val lie Ala Leu 
20 25 30 

15 

Cys His Gin Ala Lys Thr Pro Val Gly Asn Thr Ala Ala lie Cys lie 
35 40 45 

Tyr Pro Arg Phe lie Pro lie Ala Arg Lys Thr Leu Lys Glu Gin Gly 
20 50 55 60 

Thr Pro Glu lie Arg He Ala Thr Val Thr Asn Phe Pro His Gly Asn 
65 70 75 80 

25 Asp Asp He Asp He Ala Leu Ala Glu Thr Arg Ala Ala He Ala Tyr 

85 90 95 

Gly Ala Asp Glu Val Asp Val Val Phe Pro Tyr Arg Ala Leu Met Ala 
100 105 110 

30 

Gly Asn Glu Gin Val Gly Phe Asp Leu Val Lys Ala Cys Lys Glu Ala 
115 120 125 

Cys Ala Ala Ala Asn Val Leu Leu Lys Val He He Glu Thr Gly Glu 
35 130 135 140 

Leu Lys Asp Glu Ala Leu He Arg Lys Ala Ser Glu He Ser He Lys 
145 150 155 160 

40 Ala Gly Ala Asp Phe He Lys Thr Ser Thr Gly Lys Val Ala Val Asn 

165 170 175 

Ala Thr Pro Glu Ser Ala Arg He Met Met Glu Val lie Arg Asp Met 
180 185 190 

45 
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Gly Val Glu Lys Thr Val Gly Phe Lys Pro Ala Gly Gly Val Arg Thr 
195 200 205 

Ala Glu Asp Ala Gin Lys Tyr Leu Ala lie Ala Asp Glu Leu Phe Gly 
5 210 215 220 

Ala Asp Trp Ala Asp Ala Arg His Tyr Arg Phe Gly Ala Ser Ser Leu 
225 230 235 240 

10 Leu Ala Ser Leu Leu Lys Ala Leu Gly His Gly Asp Gly Lys Ser Ala 
245 250 255 

Ser Ser Tyr 



15 



<210> 9 
<211> 1080 
20<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
25 <222> (1) . . (1077) 

<400> 9 

atg tct aag att ttt gat ttc gta aaa cct ggc gta ate act ggt gat 48 

Met Ser Lys lie Phe Asp Phe Val Lys Pro Gly Val He Thr Gly Asp 
30 1 5 10 15 

gac gta cag aaa gtt ttc cag gta gca aaa gaa aac aac ttc gca ctg 96 
Asp Val Gin Lys Val Phe Gin Val Ala Lys Glu Asn Asn Phe Ala Leu 
20 25 30 



35 



cca gca gta aac tgc gtc ggt act gac tec ate aac gee gta ctg gaa 144 
Pro Ala Val Asn Cys Val Gly Thr Asp Ser He Asn Ala Val Leu Glu 
35 40 45 



40 acc get get aaa gtt aaa gcg ccg gtt ate gtt cag ttc tec aac ggt 
Thr Ala Ala Lys Val Lys Ala Pro Val He Val Gin Phe Ser Asn Gly 
50 55 60 



192 



ggt get tec ttt ate get ggt aaa ggc gtg aaa tct gac gtt ccg cag 
45 Gly Ala Ser Phe He Ala Gly Lys Gly Val Lys Ser Asp Val Pro Gin 



240 
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65 70 75 80 

ggt get get ate ctg ggc gcg ate tct ggt gcg cat cac gtt cac cag 288 
Gly Ala Ala lie Leu Gly Ala lie Ser Gly Ala His His Val His Gin 
5 85 90 95 

atg get gaa cat tat ggt gtt ccg gtt ate ctg cac act gac cac tgc 336 

Met Ala Glu His Tyr Gly Val Pro Val lie Leu His Thr Asp His Cys 

100 105 110 

10 

gcg aag aaa ctg ctg ccg tgg ate gac ggt ctg ttg gac gcg ggt gaa 384 
Ala Lys Lys Leu Leu Pro Trp lie Asp Gly Leu Leu Asp Ala Gly Glu 
115 120 125 

15 aaa cac ttc gca get acc ggt aag ccg ctg ttc tct tct cac atg ate 432 
Lys His Phe Ala Ala Thr Gly Lys Pro Leu Phe Ser Ser His Met lie 
130 135 140 

gac ctg tct gaa gaa tct ctg caa gag aac ate gaa ate tgc tct aaa 480 
20 Asp Leu Ser Glu Glu Ser Leu Gin Glu Asn lie Glu He Cys Ser Lys 
145 150 155 160 

tac ctg gag cgc atg tec aaa ate ggc atg act ctg gaa ate gaa ctg 528 
Tyr Leu Glu Arg Met Ser Lys He Gly Met Thr Leu Glu He Glu Leu 
25 165 170 175 

ggt tgc acc ggt ggt gaa gaa gac ggc gtg gac aac age cac atg gac 576 
Gly Cys Thr Gly Gly Glu Glu Asp Gly Val Asp Asn Ser His Met Asp 
180 185 190 

30 

get tct gca ctg tac acc cag ccg gaa gac gtt gat tac gca tac acc 624 
Ala Ser Ala Lieu Tyr Thr Gin Pro Glu Asp Val Asp Tyr Ala Tyr Thr 
195 200 205 

35 gaa ctg age aaa ate age ccg cgt ttc acc ate gca gcg tec ttc ggt 672 
Glu Leu Ser Lys He Ser Pro Arg Phe Thr He Ala Ala Ser Phe Gly 
210 215 220 



720 



aac gta cac ggt gtt tac aag ccg ggt aac gtg gtt ctg act ccg acc 

40 Asn Val His Gly Val Tyr Lys Pro Gly Asn Val Val Leu Thr Pro Thr 
225 230 235 240 

ate ctg cgt gat tct cag gaa tat gtt tec aag aaa cac aac ctg ccg 768 

He Leu Arg Asp Ser Gin Glu Tyr Val Ser Lys Lys His Asn Leu Pro 
45 245 250 255 
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cac aac age ctg aac ttc gta ttc cac ggt ggt tec ggt tct act get 

His Asn Ser Leu Asn Phe Val Phe His Gly Gly Ser Gly Ser Thr Ala 
260 265 270 



816 



5 cag gaa ate aaa gac tec gta age tac ggc gta gta aaa atg aac ate 864 
Gin Glu lie Lys Asp Ser Val Ser Tyr Gly Val Val Lys Met Asn lie 
275 280 285 

gat ace gat ace caa tgg gca ace tgg gaa ggc gtt ctg aac tac tac 912 
10 Asp Thr Asp Thr Gin Trp Ala Thr Trp Glu Gly Val Leu Asn Tyr Tyr 
290 295 300 



aaa gcg aac gaa get tat ctg cag ggt cag ctg ggt aac ccg aaa ggc 
Lys Ala Asn Glu Ala Tyr Leu Gin Gly Gin Leu Gly Asn Pro Lys Gly 
15 305 310 315 320 



960 



20 



gaa gat cag ccg aac aag aaa tac tac gat ccg cgc gta tgg ctg cgt 1008 
Glu Asp Gin Pro Asn Lys Lys Tyr Tyr Asp Pro Arg Val Trp Leu Arg 
325 330 335 

gee ggt cag act teg atg ate get cgt ctg gag aaa gca ttc cag gaa 1056 
Ala Gly Gin Thr Ser Met lie Ala Arg Leu Glu Lys Ala Phe Gin Glu 
340 345 350 



25 ctg aac gcg ate gac gtt ctg taa 
Leu Asn Ala lie Asp Val Leu 
355 



1080 



30<210> 10 
<211> 359 
<212> PRT 

<213> Escherichia coli 
35 <400> 10 

Met Ser Lys He Phe Asp Phe Val Lys Pro Gly Val He Thr Gly Asp 
15 10 15 

Asp Val Gin Lys Val Phe Gin Val Ala Lys Glu Asn Asn Phe Ala Leu 
40 20 25 30 

Pro Ala Val Asn Cys Val Gly Thr Asp Ser He Asn Ala Val Leu Glu 
35 40 45 



45 Thr Ala Ala Lys Val Lys Ala Pro Val He Val Gin Phe Ser Asn Gly 
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50 55 60 

Gly Ala Ser Phe He Ala Gly Lys Gly Val Lys Ser Asp Val Pro Gin 
65 70 75 80 

5 

Gly Ala Ala lie Leu Gly Ala He Ser Gly Ala His His Val His Gin 
85 90 95 

Met Ala Glu His Tyr Gly Val Pro Val lie Leu His Thr Asp His Cys 
10 100 105 110 

Ala Lys Lys Leu Leu Pro Trp He Asp Gly Leu Leu Asp Ala Gly Glu 
115 120 125 

15 Lys His Phe Ala Ala Thr Gly Lys Pro Leu Phe Ser Ser His Met He 
130 135 140 

Asp Leu Ser Glu Glu Ser Leu Gin Glu Asn He Glu He Cys Ser Lys 
145 150 155 160 

20 

Tyr Leu Glu Arg Met Ser Lys He Gly Met Thr Leu Glu He Glu Leu 
165 170 175 

Gly Cys Thr Gly Gly Glu Glu Asp Gly Val Asp Asn Ser His Met Asp 
25 180 185 190 

Ala Ser Ala Leu Tyr Thr Gin Pro Glu Asp Val Asp Tyr Ala Tyr Thr 
195 200 205 

30 Glu Leu Ser Lys He Ser Pro Arg Phe Thr He Ala Ala Ser Phe Gly 
210 215 220 

Asn Val His Gly Val Tyr Lys Pro Gly Asn Val Val Leu Thr Pro Thr 
225 230 235 240 

35 

He Leu Arg Asp Ser Gin Glu Tyr Val Ser Lys Lys His Asn Leu Pro 
245 250 255 

His Asn Ser Leu Asn Phe Val Phe His Gly Gly Ser Gly Ser Thr Ala 
40 260 265 270 

Gin Glu He Lys Asp Ser Val Ser Tyr Gly Val Val Lys Met Asn He 
275 280 285 



45 Aep Thr Asp Thr Gin Trp Ala Thr Trp Glu Gly Val Leu Asn Tyr Tyr 
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290 



295 



300 



Lys Ala Asn Glu Ala Tyr Leu Gin Gly Gin Leu Gly Asn Pro Lys Gly 
305 310 315 320 

5 

Glu Asp Gin Pre Asn Lys Lys Tyr Tyr Asp Pro Arg Val Trp Leu Arg 
325 330 335 

Ala Gly Gin Thr Ser Met lie Ala Arg Leu Glu Lys Ala Phe Gin Glu 
10 340 345 350 

Leu Asn Ala He Asp Val Leu 
355 



15 



<210> 11 
<211> 921 
<212> DNA 
20 <213> Salmonella typhi 

<220> 

<221> CDS 

<222> (1) . . (918) 



25 



30 



<400> 11 

atg gat ate gcg gtt att ggc tct aac atg gtg gac ctt ate ace tac 48 
Met Asp He Ala Val He Gly Ser Asn Met Val Asp Leu He Thr Tyr 
1 5 10 15 

acc aac cag atg ccc aaa gaa ggg gaa act ctg gaa gcg ccg gcg ttt 96 
Thr Asn Gin Met Pro Lys Glu Gly Glu Thr Leu Glu Ala Pro Ala Phe 
20 25 30 



35 aaa ate ggc tgc ggc gga aaa ggg gcg aac cag gec gtg gcg gec get 
Lys He Gly Cys Gly Gly Lys Gly Ala Asn Gin Ala Val Ala Ala Ala 
35 40 45 



144 



aag etc aat tea aaa gta ttg atg ttg acc aaa gtg ggc gac gat att 
40 Lys Leu Asn Ser Lys Val Leu Met Leu Thr Lys Val Gly Asp Asp He 
50 55 60 



192 



ttt gec 
Phe Ala 
45 65 



gac aac acc att cgt aat etc gaa tec tgg ggg ate aat acg 
Asp Asn Thr He Arg Asn Leu Glu Ser Trp Gly He Asn Thr 
70 75 80 



240 
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acg tat gta gaa aaa gta ccg tgt acc age age ggc gta gcg ccg att 288 

Thr Tyr Val Glu Lys Val Pro Cys Thr Ser Ser Gly Val Ala Pro lie 
85 90 95 

5 ttc gtc aac gec aac tec age aac age att ctg ate ate aaa ggc get 336 
Phe Val Asn Ala Asn Ser Ser Asn Ser lie Leu lie lie Lys Gly Ala 
100 105 110 

aac aag ttt etc teg ccg gaa gat ate gat cgc gcg gcg gaa gat tta 384 
10 Asn Lys Phe Leu Ser Pro Glu Asp lie Asp Arg Ala Ala Glu Asp Leu 
115 120 125 

aaa aaa tgc cag ctt att gtt ctg caa ctg gaa gtt cag ctt gaa acg 432 
Lys Lys Cys Gin Leu lie Val Leu Gin Leu Glu Val Gin Leu Glu Thr 
15 130 135 140 

gtt tat cac gca ata gaa ttt ggc aag aaa cac ggg att gaa gtg tta 480 

Val Tyr His Ala lie Glu Phe Gly Lys Lys His Gly lie Glu Val Leu 
145 150 155 160 

20 

tta aac cct gcg cca gca tta egg gaa tta gat atg tct tat gee tgt 528 
Leu Asn Pro Ala Pro Ala Leu Arg Glu Leu Asp Met Ser Tyr Ala Cys 
165 170 175 

25 aaa tgc gat ttc ttt gta cct aat gaa acc gag ctg gaa ata tta acc 576 
Lys Cys Asp Phe Phe Val Pro Asn Glu Thr Glu Leu Glu lie Leu Thr 
180 185 190 

ggt atg cca gtg gat acc tat gac cat att cgc gca gcg gca cgt teg 624 
30 Gly Met Pro Val Asp Thr Tyr Asp His lie Arg Ala Ala Ala Arg Ser 
195 200 205 

ctg gta gat aaa ggg ctg aac aat att att gtc acc atg ggc gag aaa 672 
Leu Val Asp Lys Gly Leu Asn Asn He He Val Thr Met Gly Glu Lys 
35 210 215 220 

ggc gcg ctg tgg atg acg cgt gac cag gaa gtc cat gtt ccg gcg ttt 72 0 
Gly Ala Leu Trp Met Thr Arg Asp Gin Glu Val His Val Pro Ala Phe 
225 230 235 240 

40 

aga gtg aac get gtt gat acc age ggc gcg ggc gat gee ttt ate ggc 768 
Arg Val Asn Ala Val Asp Thr Ser Gly Ala Gly Asp Ala Phe He Gly 
245 250 255 



45 tgt ttc gcg cat tac tac gtc cag age ggg gat gtg gaa gee gee atg 



816 
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Cys Phe Ala His Tyr Tyr Val Gin Ser Gly Asp Val Glu Ala Ala Met 
260 265 270 



aaa aaa gee gtc etc ttt gee get ttc age gtc ace ggg aaa ggc ace 
5 Lys Lys Ala Val Leu Phe Ala Ala Phe Ser Val Thr Gly Lys Gly Thr 
275 280 285 



864 



caa tec tct tat cca age att gag caa ttt aat gag tat ctt teg ttg 
Gin Ser Ser Tyr Pro Ser lie Glu Gin Phe Asn Glu Tyr Leu Ser Leu 
10 290 295 300 



912 



aac gaa taa 
Asn Glu 
305 



921 



15 



<210> 12 
<211> 306 
<212> PRT 
20<213> Salmonella typhi 

<400> 12 

Met Asp He Ala Val He Gly Ser Asn Met Val Asp Leu He Thr Tyr 
15 10 15 

25 

Thr Asn Gin Met Pro Lys Glu Gly Glu Thr Leu Glu Ala Pro Ala Phe 
20 25 30 

Lys He Gly Cys Gly Gly Lys Gly Ala Asn Gin Ala Val Ala Ala Ala 
30 35 40 45 

Lys Leu Asn Ser Lys Val Leu Met Leu Thr Lys Val Gly Asp Asp He 
50 55 60 

35 Phe Ala Asp Asn Thr He Arg Asn Leu Glu Ser Trp Gly He Asn Thr 
65 70 75 80 



40 



Thr Tyr Val Glu Lys Val Pro Cys Thr Ser Ser Gly Val Ala Pro He 
85 90 95 

Phe Val Asn Ala Asn Ser Ser Asn Ser He Leu He He Lys Gly Ala 
100 105 110 



Asn Lys Phe Leu Ser Pro Glu Asp He Asp Arg Ala Ala Glu Asp Leu 
45 115 120 125 
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Lys Lys Cys Gin Leu lie Val Leu Gin Leu Glu Val Gin Leu Glu Thr 
130 135 140 

Val Tyr His Ala lie Glu Phe Gly Lys Lys His Gly lie Glu Val Leu 
5 145 150 155 160 

Leu Asn Pro Ala Pro Ala Leu Arg Glu Leu Asp Met Ser Tyr Ala Cys 
165 170 175 

10 Lys Cys Asp Phe Phe Val Pro Asn Glu Thr Glu Leu Glu lie Leu Thr 
180 185 190 

Gly Met Pro Val Asp Thr Tyr Asp His lie Arg Ala Ala Ala Arg Ser 
195 200 205 

15 

Leu Val Asp Lys Gly Leu Asn Asn lie He Val Thr Met Gly Glu Lys 
210 215 220 

Gly Ala Leu Trp Met Thr Arg Asp Gin Glu Val His Val Pro Ala Phe 
20 225 230 235 240 

Arg Val Asn Ala Val Asp Thr Ser Gly Ala Gly Asp Ala Phe He Gly 
245 250 255 

25 Cys Phe Ala His Tyr Tyr Val Gin Ser Gly Asp Val Glu Ala Ala Met 
260 265 270 

Lys Lys Ala Val Leu Phe Ala Ala Phe Ser Val Thr Gly Lys Gly Thr 
275 280 285 

30 

Gin Ser Ser Tyr Pro Ser He Glu Gin Phe Asn Glu Tyr Leu Ser Leu 
290 295 300 

Asn Glu 
35 305 



<210> 13 

40 <211> 483 

<:212> DNA 

<213> Lactobacillus leichmannii 



<220> 
46 <221> CDS 
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<222> (10) . . (480) 



<400> 13 

gtatactaa atg cca aaa aag acg ate tac ttc ggt gec ggc tgg ttc act 51 
5 Met Pro Lys Lys Thr lie Tyr Phe Gly Ala Gly Trp Phe Thr 

15 10 



gac cgc caa aac aaa gec tac aag gaa gec atg gaa gec etc aag gaa 
Asp Arg Gin Asn Lys Ala Tyr Lys Glu Ala Met Glu Ala Leu Lys Glu 
10 15 20 25 30 



99 



15 



aac cca acg att gac ctg gaa aac age tac gtt ccc ctg gac aac cag 147 
Asn Pro Thr lie Asp Leu Glu Asn Ser Tyr Val Pro Leu Asp Asn Gin 
35 40 45 

tac aag ggt ate egg gtt gat gaa cac ccg gaa tac ctg cat gac aag 195 
Tyr Lys Gly lie Arg Val Asp Glu His Pro Glu Tyr Leu His Asp Lys 
50 55 60 



20 gtt tgg get acg gee ace tac aac aac gac ttg aac ggg ate aag ace 
Val Trp Ala Thr Ala Thr Tyr Asn Asn Asp Leu Asn Gly lie Lys Thr 
65 70 75 



243 



aac gac ate atg ctg ggt gtc tac ate cct gac gaa gaa gac gtc ggc 
25 Asn Asp lie Met Leu Gly Val Tyr lie Pro Asp Glu Glu Asp Val Gly 
80 85 90 



291 



ctg ggc atg gaa ctg ggt tac gec ttg age caa ggc aag tac gtc ctt 
Leu Gly Met Glu Leu Gly Tyr Ala Leu Ser Gin Gly Lys Tyr Val Leu 
30 95 100 105 110 



339 



35 



ttg gtc ate ccg gac gaa gac tac ggc aag ccg ate aac etc atg age 387 

Leu Val lie Pro Asp Glu Asp Tyr Gly Lys Pro lie Asn Leu Met Ser 
115 120 125 

tgg ggc gtc age gac aac gtg ate aag atg age cag ctg aag gac ttc 435 

Trp Gly Val Ser Asp Asn Val He Lys Met Ser Gin Leu Lys Asp Phe 
130 135 140 



40 aac ttc aac aag ccg cgc ttc gac ttc tac gaa ggt gec gta tac taa 
Asn Phe Asn Lys Pro Arg Phe Asp Phe Tyr Glu Gly Ala Val Tyr 
145 150 155 



483 



45 <210> 14 



mm 
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<211> 157 
<212> PRT 

<213> Lactobacillus leichmannii 
5 <400> 14 

Met Pro Lys Lys Thr He Tyr Phe Gly Ala Gly Trp Phe Thr Asp Arg 
15 10 15 

Gin Asn Lys Ala Tyr Lys Glu Ala Met Glu Ala Leu Lys Glu Asn Pro 
10 20 25 30 

Thr He Asp Leu Glu Asn Ser Tyr Val Pro Leu Asp Asn Gin Tyr Lys 
35 40 45 

15 Gly He Arg Val Asp Glu His Pro Glu Tyr Leu His Asp Lys Val Trp 
50 55 60 

Ala Thr Ala Thr Tyr Asn Asn Asp Leu Asn Gly He Lys Thr Asn Asp 
65 70 75 80 

20 

He Met Leu Gly Val Tyr He Pro Asp Glu Glu Asp Val Gly Leu Gly 
85 50 95 

Met Glu Leu Gly Tyr Ala Leu Ser Gin Gly Lys Tyr Val Leu Leu Val 
25 100 105 110 

He Pro Asp Glu Asp Tyr Gly Lys Pro He Asn Leu Met Ser Trp Gly 
115 120 125 

30 Val Ser Asp Asn Val He Lys Met Ser Gin Leu Lys Asp Phe Asn Phe 
130 135 140 

Asn Lys Pro Arg Phe Asp Phe Tyr Glu Gly Ala Val Tyr 
145 150 155 

35 
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A method for the in vitro enzymatic synthesis of 
deoxyribonucleosides comprising reacting deoxyribose 1 -phosphate 
(dRIP) and a nucleobase, wherein a deoxyribonucleoside and 
inorganic phosphate are formed. 

The method of claim 1 , wherein the inorganic phosphate is removed. 



10 



The method of claim 1 or 2, wherein the reaction is catalyzed by a 
thymidine phosphorylase (TP, EC 2.4.2.4) or a purine nucleoside 
phosphorylase (PNP, EC 2.4.2.1). 



4. The method of any one of the previous claims, wherein the 
15 nucleobase is selected from the group consisting of thymine, uracil, 

adenine, guanine and hypoxanthine and analogs thereof , e.g. 2-thio- 
uracil, 6-aza-uracil, 5-carboxy-2-thio-uracil, 6-aza-thymine, 6-aza-2- 
thio-thymine and 2,6-diamino-purine. 



20 5. The method of any one of the previous claims, wherein the removal 
of the inorganic phosphate is effected by (i) conversion to inorganic 
pyrophosphate, (ii) precipitation, (Hi) complexation and/or (iv) 
substrate phosphorylation. 



25 6. The method of claim 5, wherein the inorganic phosphate is converted 
to pyrophosphate by a phosphate transfer from f ructose-diphosphate 
(FDP) under formation of fructose-6-phosphate (F6P). 



30 



7. 



The method of claim 6, wherein the phosphate transfer is catalyzed 
by a PPi-dependent phosphofructokinase (PFK-PPi, EC 2.7.1.90). 
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8. The method of claim 6 or 7, wherein the inorganic pyrophosphate is 
removed by precipitation. 

9. The method of claim 5, wherein the inorganic phosphate is 
5 transferred to a disaccharide, particularly sucrose or maltose under 

formation of a monosaccharide and a phosphorylated 
monosaccharide. 

10. The method of claim 9, wherein the phosphate transfer is catalyzed 
10 by a sucrose phosphorylase (EC 2.4.1 .7) or a maltose phosphorylase 

(EC 2.4.1.8). 

11. The method of claim 10, wherein the phosphorylated 
monosaccharide is further reacted. 

15 

12. The method of any one of the previous claims, wherein the 
deoxyribose- 1 -phosphate is generated from deoxyribose 5-phosphate 
(dR5P). 

20 13. The method of claim 12, wherein the reaction is catalyzed by a 
deoxyribomutase (EC 2.7.5.1 ) or a phosphopentose mutase (PPM, EC 
5.4.2.7). 

1 4. The method of claim 1 2 or 1 3, wherein the deoxyribose-5-phosphate 
25 is generated by a condensation of glyceraldehyde 3-phosphate (GAP) 

with acetaldehyde. 

15. The method of claim 14, wherein the reaction is catalyzed by a 
phosphopentose aldolase (PPA, EC 4.1.2.4). 

30 
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16. The method of claim 14 or 15, wherein the glyceraldehyde 3- 
phosphate is generated from fructose 1 ,6-diphosphate, 
dihydroxyacetone (DHA) and/or glycerophosphate. 

5 1 7. The method of claim 1 6, wherein the glyceraldehyde 3-phosphate is 
generated from fructose 1 ,6-diphosphate in a reaction catalyzed by 
an FDP-aldolase (EC 4.1.2.13) selected from FDP-aldolases I and 
FDP-aldolases II. 

10 18. The method of claim 1 6, wherein the glyceraldehyde 3-phosphate is 
generated from dihydroxyacetone and ATP under formation of 
dihydroxyacetone phosphate (DHAP) and ADP and subsequent 
isomerization of DHAP to GAP in a reaction catalyzed by a 
glycerokinase (GK, EC 2.7.1.30) and a triose phosphate isomerase 

15 (TIM, EC 5.3.1.1). 

1 9. The method of claim 1 6, wherein the glyceraldehyde 3-phosphate is 
generated from glycerol phosphate (GP) and 0 2 under formation of 
dihydroxyacetone phosphate (DHAP) and H 2 0 2 and subsequent 
20 isomerization of DHAP to GAP in a reaction catalyzed by a 

glycerophosphate oxidase (GPO, EC 1 . 1 .3.21 ) and a triose phosphate 
isomerase (TIM, EC 5.3.1.1). 

20. The method of claim 1 2 or 1 3, wherein the deoxyribose 5-phosphate 
25 is generated by a phosphorylation of deoxyribose. 

21. The method of claim 20, wherein the reaction is catalyzed by a 
deoxyribokinase (dRK, EC 2.7.1.15). 

30 22. The method of claim 21, wherein a dRK obtainable from Salmonella 
typhi is used which is encoded by (a) the nucleotide sequence shown 
in SEQ ID NO. 11 or its complementary sequence, (b) a nucleotide 
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sequence corresponding to the sequence of (a) in the scope of the 
degeneracy of the genetic code or (c) a nucleotide sequence 
hybridizing under stringent conditions to the sequence of (a) and/or 
(b). 

23. The method of any one of the previous claims, wherein a 
deoxyribonucleoside containing a first nucleobase is further reacted 
with a second nucleobase under formation of a deoxyribonucleoside 
containing the second nucleobase. 

24. The method of claim 23, wherein said second nucleobase is selected 
from cytidine and analogs thereof, e.g. 5-aza-cytidine, 2,6-dichloro- 
purine, 6-chloro-guanine, 6-chIoro-purine, 6-aza-thymine and 5-fluoro- 
uracil. 

25. The method of claim 24, wherein the reaction is catalyzed by a 
nucleoside 2-deoxyribosyl transferase {NdT, EC 2.4.2.6). 



26. The method of claim 25, wherein an NdT obtainable from 
20 Lactobacillus leichmannii is used which is encoded by (a) the 

nucleotide sequence shown in SEQ ID NO. 13 or its complementary 
sequence, (b) a nucleotide sequence corresponding to the sequence 
of (a) in the scope of the degeneracy of the genetic code or (c) a 
nucleotide sequence hybridizing under stringent conditions to the 
25 sequence of (a) and/or (b). 



27. A method for the in vitro enzymatic synthesis of 
deoxyribonucleosides comprising the steps of: 

(i) condensing glyceraldehyde 3-phosphate (GAP) with 
30 acetaldehyde to deoxyribose 5-phosphate (dR5P), 

(ii) isomerizing deoxyribose 5-phosphate to deoxyribose 1 - 
phosphate (dRIP) and 



-08^1999 



6 28. 



28. 

10 

30. 



15 31. 



32. 

20 



33. 

25 
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(iii) reacting deoxyribose 1 -phosphate and nudeobase, 

wherein a deoxyribonucleoside and inorganic phosphate 
are formed. 

The method of claim 27 ', wherein the reaction is carried out without 
isolating intermediate products. 

The method of claim 27 or 28, wherein the glyceraldehyde 3- 
phosphate (GAP) is generated from fructose 1 ,6-diphosphate (FDP), 
dihydroxy-acetone (DHA) and/or glycerolphosphate (GP). 

The method of claims 27 to 29, wherein before step (ii) excess 
acetaldehyde is removed. 

The method of claims 27 to 30, wherein before step (ii) excess 
starting materials and/or by-products, particularly fructose 1,6- 
diphosphate and/or deoxyxylulose 1 -phosphate (dX1 P) are removed. 

The method of claims 27 to 30, wherein the reaction is carried out 
in a manner that before step (ii) no substantial amounts of starting 
materials and/or by-products, particularly fructose 1 ,6-diphosphate 
and/or deoxyxylulose 1 -phosphate are present. 

A method for the in vitro enzymatic synthesis of 
deoxyribonucleosides comprising the steps of: 

(i) phosphorylating deoxyribose to deoxyribose 5- 
phosphate, 

(ii) isomerizing deoxyribose 5-phosphate to deoxyribose 1 - 
phosphate and 

(Hi) reacting deoxyribos 1 -phosphate and nudeobase, 

wherein a deoxyribonucleoside an inorganic phosphate 
are formed. 
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34. The method of claim 33, wherein the reaction is carried out without 
isolating intermediate products. 

35. The method of claims 27 to 34, wherein the inorganic phosphate is 
5 removed. 

36. The method of any one of the previous claims comprising further 
reacting said deoxyribonucleoside. 

10 37. The method of claim 36, wherein said further reacting comprises the 
synthesis of deoxyribonucleoside mono-, di- or triphosphates, of H- 
phosphonates or of phosphoramidites. 

38. The use of an isolated nucleic acid molecule encoding a nucleoside 
15 2-deoxyribosyl transferase (NdT, EC 2.4.2.6) for the preparation of 

an enzyme for an in vitro method for the enzymatic synthesis of 
deoxyribonucleosides, wherein a deoxyribonucleoside containing a 
first nucleobase is further reacted with a second nucleobase under 
formation of a deoxyribonucleoside containing the second 

20 nucleobase, wherein said nucleic acid molecule comprises (a) the 

nucleotide sequence shown in SEQ ID NO. 13 or its complementary 
sequence, (b) a nucleotide sequence corresponding to the sequence 
of (a) in the scope of the degeneracy of the genetic code or (c) a 
nucleotide sequence hybridizing under stringent conditions to the 

25 sequence of (a) and/or (b). 

39. The use of claim 38, wherein the second nucleobase is selected from 
cytidine and analogs thereof, e.g. 6-methyl purine, 2-amino-6- 
methylmercaptopurine, 6-dimethylaminopurine, 5-azacytidine, 2,6- 

30 dichloropurine, 6-chloroguanine, 6-chloropurine, 6-azathymine, 5- 

fluorouracil, ethyl-4-amino-5-imidazol carboxylate, imidazol-4- 
carboxamide and 1 ,2,4-triazole-3-carboxamide. 



< 
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40. The use of claim 38 or 39, wherein the first nucleobase is selected 
from adenine, guanine, thymine, uracil and hypoxanthine. 

41 . The use of any one of claims 38-40, wherein the nucleic acid 
molecule is contained on a recombinant vector in operative linkage 
with an expression control sequence. 

42. The use of any one of claims 38-41, wherein the nucleic acid is 
contained in a recombinant cell. 

43. Use of an isolated polypeptide having NdT activity for the preparation 
of nucleosides according to claim 24. 



44. Use of an isolated nucleic acid molecule encoding a deoxyribokinase 
is (dRK, EC 2.7.1.5) for the preparation of an enzyme for an in vitro 

method for the enzymatic synthesis of deoxyribonucleosides 
comprising the step of phosphorylating deoxyribose to deoxyribose 
5-phosphate, wherein said nucleic acid molecule comprises (a) the 
nucleotide sequence shown in SEQ ID NO.1 1 or its complementary 
20 sequence, (b) a nucleotide sequence corresponding to the sequence 

of (a) in the scope of the degeneracy of the genetic code or (c) a 
nucleotide sequence hybridizing under stringent conditions to the 
sequence of (a) and/or (b). 



25 45. Use of an isolated polypeptide having dRK activity for an in vitro 
method for the enzymatic synthesis of deoxyribonucleosides 
comprising the step of phosphorylating deoxyribose to deoxyribose 
5-phosphate. 



30 



46. 



Recombinant bacteria strains deposited at CNCM under accession 
numbers 1-2186, 1-2187, 1-2188, 1-2189, 1-2190 and 1-2191. 
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Abstract 



The present invention relates to a method for the in vitro enzymatic 
5 synthesis of deoxyribonucleosides and enzymes suitable for this method. 
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